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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Atty. Docket No: 065691/0270 

In re patent application of 
PITIOT, GILLES et al . 
Serial No. 10/049,372 
Filed: February 12, 2002 

For: HUMAN ODORANT- BINDING PROTEINS WHICH BIND HYDROPHOBIC LIGANDS : 

POLYPEPTIDES AND POLYNUCLEOTIDES ENCODING SAID POLYPEPTIDES, AND USES 
THEREOF 

STATEMENT TO SUPPORT FILING AND SUBMISSION IN 
ACCORDANCE WITH 37 C.F.R. §§ 1.821-1.825 



Assistant Commissioner for Patents 
Washington, D.C. 20231 
Box SEQUENCE 

Sir: 

In connection with a Sequence Listing submitted concurrently 
herewith, the undersigned hereby states that: 

1. the submission, filed herewith in accordance with 37 

C.F.R. § 1.821(g), does not include new matter; 

2 . the content of the attached paper copy and the 

attached computer readable copy of the Sequence Listing, submitted in 
accordance with 37 C.F.R. § 1.821(c) and (e), respectively, are the same; 
and 

3. all statements made herein of their own knowledge are 

true and that all statements made on information and belief are believed to 
be true; and further, that these statements were made with the knowledge 
that willful false statements and the like so made are punishable by fine 
or imprisonment, or both, under Section 1001 of Title 18 of the United 
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Serial No. 10/049,372 

States Code and that such willful false statements may jeopardize the 
validity of the application or any patent resulting therefrom. 



HARBOR CONSULTING 

Intellectual Property Services 

1500A Lafayette Road 

Suite 262 

Portsmouth, N.H. 

800-318-3021 



Respectfully submitted, 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Application No. : 

U.S. National Serial No. : 

Filed : 

PCT International Application No. : PCT/FR00/02319 



I, the below named translator, hereby declare that: 
My name and post office address are as stated below; 

That I am knowledgeable in the French language in which the below identified international 
application was filed, and that, to the best of my knowledge and belief, the English translation 
of the amended sheets of the international application No. PCT/FR00/02319 is a true and 
complete translation of the amended sheets of the above identified international application as 



I hereby declare that all the statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful false statements may jeopardize the validity of the patent 
application issued thereon. 



VERIFICATION OF A TRANSLATION 



filed. 



Date: February 28, 2002 




Full name of the translator : 



Elaine Patricia PARRISH 



For and on behalf of RWS Group pic 



Post Office Address : 



Europa House, Marsham Way, 
Gerrards Cross, Buckinghamshire, 
England. 
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Atty. Dkt. No. 065691/0270 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicants: Gilles PITIOT et al. 

Title: HUMAN ODORANT-BINDING PROTEINS WHICH BIND HYDROPHOBIC LIGANDS: 

POLYPEPTIDES AND POLYNUCLEOTIDES ENCODING SAID POLYPEPTIDES, 
AND USES THEREOF 

Appl. No. 10/049,372 

Filing Date: 08/15/2002 

Examiner: Unassigned 

Art Unit: Unassigned 



AMENDMENT IN RESPONSE TO NOTICE UNDER 37 CFR §§1.821-825 



Commissioner?for Patents 
Washington, D.C. 20231 

Sir: 

In response to the Notice to Comply With Requirements for Applications 
Containing Sequence Disclosures mailed April 16, 2002, please amend the application as 
follows: 



In the Specification: 

Please amend the specification as shown: 

Please delete the paragraph on page 6, lines 28-30, and replace it with the 
following paragraph: 

The polypeptides according to the invention are characterized 
in that they preferably comprise the Gly-Thr-Trp-Tyr (SEQ ID NO: 25) 
domain . 
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Please delete the paragraph on page 33, lines 6-15, and replace it with the 
following paragraph: 

The upper lines represent the hOBPIIa sequences (SEQ ID NOS 
17-19, 21-24) and the lower lines represent the hOBPIIb sequence 
(SEQ ID NO: 20) for which only the different nucleotides are 
represented; a dash indicates the absence of corresponding 
sequences. The shaded capitals are the exon sequences and the lower 
case letters are the intron sequences. The sizes indicated on the 
left are indicated in bp. The TATA box is in bold characters and 
the polyadenylation signal is underlined. The boxes indicate the 
splice acceptor sites for exons 5, 5b and 5c. 



Please delete the paragraph on page 33, lines 35-39 thru page 34 lines 1-4, and 
replace it with the following paragraph: 

FIGURE 5: Aligment of the protein sequences derived from the two 
human genes hOBPIIa and hOBPIIb (hOBPIIact = OB P 2 a a HOMS A (SEQ ID NO: 
2) , hOBPIIba = OBP2baHOMSA (SEQ ID NO: 10) , hOBPIIbp = OPB2bbHOMSA 
(residues 15-179 of SEQ ID NO: 12) , hOBPIIay = OBP2 agHOMSA (SEQ ID 
NO: 6), hOBPIIap = OB P2 abHOMS A (SEQ ID NO: 4)), of human tear 
lipocalins (LCNl_HOMSA) (SEQ ID NO: 46) , of rat OBPII 
(OBP2_RATNO) (SEQ ID NO: 47) , of bovine lactoglobulin BLG 
(LACB_BOSTA) (SEQ ID NO: 48) , of mouse MUP (MUP6_MUSMU) (SEQ ID NO: 
49) , of human RBP (RBP_HOMSA) (SEQ ID NO: 50) , of bovine OBP 
(OBP_BOSTA) (SEQ ID NO: 51) , of rat MUP (MUP_RATNO) (SEQ ID NO: 52) 
and of porcine OBP (OBP_SUSSC) (SEQ ID NO: 53) . 

Please delete the paragraph on page 34, lines 6-16 and replace it with the 
following paragraph: 

The residues in the dark grey boxes are identical and those in the 
light gray boxes are similar. The elements of secondary structure 
predicted with the DSC program are underlined and the amino acid 
residues are in italics. The |3-sheets and the a-helices are 
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numbered for hOPBIIa and b. The anticipated signal peptide cleavage 
site is indicated with an arrow (AAAjLS) at position 15. Nonaligned 
sequences of very divergent forms of spliced genes, hOPBa6 
<OPB2adHOMSA) (SEQ ID NO: 8) and hOBPby ( OBP2bgHOMSA) (SEQ ID NO: 14) 
have been added at the bottom of the alignment after the analysis. 

Please delete the paragraph on page 39, lines 30-38 thru page 40, lines 1-2, and 
replace it with the following paragraph: 

A Lambda gtll human testes cDNA library (Clontech) (10 7 p.f.u.) was 
amplified by 30 polymerase chain reaction (PCR) cycles (94°C 45 sec, 
54°C 45 sec, 72°C 1 min 30 sec) with the primer oliEST58 
CCTGCAGGTACATGAGCTTCC (SEQ ID NO: 26) and 5' or 3' screening 
amplimers for inserts located on the arms of the lambda gtll 
vectors. A nested PCR was then carried out with oliEST26 
CGCTGTATTTGCCAGGCTCC (SEQ ID NO: 27) and oligonucleotides specific 
for the arm of the vector. The PCR products were subcloned into the 
pGEM-T (r) vector, which made it possible to obtain the 5' end of 
the hOBPII gene cDNAs . 



Please delete the paragraph on page 40, lines 20-39 thru page 41, lines 1-5, and 
replace it with the following paragraph: 

Tissue samples were collected from 45 to 55 year old Caucasian 
individuals, in agreement with the French regulations in force. The 
total RNA is extracted according to a single-step method using the 
RNA NOW® reagent, according to the manufacturer's (Biogentex) 
instructions. 5 ug of total RNA were reversed-transcribed in a 
final volume of 20 ul using 0.5 ng of oligonucleotide 
GACTCGAGTCGACATCGATTTTTTTTTTTTTTTTT (SEQ ID NO: 28) with the 
Superscript® preampli f ication system (Gibco BRL) . Three pi of these 
reactions were then used for the following PCRs . The expression of 
the specific mRNAs was determined by PCR using: the primers TL: 
CCTCTCCCAGCCCCAGCAAG (SEQ ID NO: 29) and AP: GACTCGAGTCGACATCG (SEQ 
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ID NO: 30) for the LCNl-type genes (LCN1, LCNlb, LCNlc) and, for the 
hOBPII-type genes, the primers DE : CGCCCAGTGACCTGCCGAGGTC (SEQ ID 
NO: 31), and FI : CTTTATTTGGAGTCAGGTGGGTG (SEQ ID NO: 32). As a 
quality control for the RNA' s the primers G3PDH1: 
CTCTGCCCCCTCTGCTGATG (SEQ ID NO: 33) and G3PDH2 : 

CCTGCTTCACCACCTTCTTG (SEQ ID NO: 34) for the G3PDH gene were used; 
the G3PDH gene is considered to be constitut ively expressed in all 
cell types. 32 PCR cycles (94°C 45 sec, 54°C 45 sec, 72°C 2 min 30 
sec) were performed and the amplification products were separated on 
1% agarose gel. The DNA was transferred onto a Hybond N+® membrane. 



Please delete the paragraph on page 41 , lines 7-1 6, and replace it with the 
following paragraph: 

For the detection of the expression of the various genes, several oligonucleotides specific for the 
respective genes were synthesized: 

- olLCNl: GACTCAGACTCCGGAGATGA (SEQ ID NO: 35), 

- olLCNlb: AACTCAGACACCAGAGATGA (SEQ ID NO: 36), 

- olLCNlc: GACTCAGATCCCGGAGATGA (SEQ ID NO: 37), 

- ol5: CCAGGAGGGACCACTACA (SEQ ID NO: 38) specific for the hOBPIIb gene, 

- o!4: CCGGGACGGACGACTACG (SEQ ID NO: 39) specific for the hOBPIIa gene, 

- G3PDH3: CTCATGACCACAGTCCATGC (SEQ ID NO: 40). 

Please delete the paragraph on page 41 , lines 30-39 thru page 42, lines1-7, and 
replace it with the following paragraph: 

The genotyping is carried out with PCR reactions using 100 ng of 
genomic DNA originating from 8 CEPH reference families, and using 
the oligonucleotides oli9 TGTTCGGGAACGCAGCTT (SEQ ID NO: 41) and 
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olilOa TGCCGCTGTCCCCACGTCGG (SEQ ID NO: 42) . The thermocycler 
parameters consist of an initial cycle at 94°C for 10 min followed 
by 30 cycles at 94°C for 30 sec, 55°C for 30 sec and 70°C for 45 
sec, and then a final elongation step of 10 min at 70°C. The PCR 
products are then analyzed on a 3% agarose gel. The information 
regarding the markers for chromosome 9 may be obtained at the 
following internet address ( hTTP : / /gal ton .ucl.ac.uk ) ; the analyses 
were carried out using the linkage study tools previously described 
in Lacazette et al. (1997). The haplotypes are reconstructed 
manually according to the recombination events previously described 
in family 1362 (Attwood et al., 1994). 

Please delete the paragraph on page 52, lines 34-39 thru page 54, Iines1-1 1 , and 
replace it with the following paragraph: 

In the case of the hOBPIIb gene, in addition to the hOBPIIba mRNA 
previously described, a 106 bp super-numerary exon (exon 3b) between 
the previous exons 3 and 4 was identified (figure 3) . This longer 
mRNA (782 nucleotides) encodes a 165 amino acid protein hOBPIIbp. 
From a protein structure point of view, hOBPIIbp is identical to 
hOBPIIba up to the 5 th putative p-sheet and then differs due to a 
reading frame shift. The predictions from the computer programs 
indicate that the motif ALWEALAI DTRLK (SEQ ID NO: 43) is an a-helix 
which is just behind the fifth p-sheet. Two additional p-sheets may 
be present in the long C-terminal portion. 

Please delete the paragraph on page 58, lines 36-39 thru page 59, lines'! -23, and 
replace it with the following paragraph: 

A PCR on the plasmid DNA of an hOBPIIba clone using the primers 
Blla/b (5' GTC GGA TCC CTG TCC TTC ACC CTG GAG G 3' ) (SEQ ID NO: 
44), a sense oligonucleotide beginning 45 bases after the protein- 
initiating ATG, and Xllb (5; GTC CTC GAG GTG TTC GGG AAC GCA GCT TC 
3' ) (SEQ ID NO: 45) , an antisense oligonucleotide preceding the stop 
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codon of the hOBPIIba protein, made it possible to amplify all of 
the DNA encoding the secreted hOBPIIba protein. The BanH I and Xho 
I enzymatic restriction sites located at the ends of the two 
oligonucleotides (bases underlined) were used for directional 
cloning into a plasmid expression vector pGEX-6Pl, followed by 
transformation by electroporat ion (1800 V, 200 Q, 25 uF) into 
bacterial strain BL21. Synthesis of the recombinant protein is 
obtained by adding IPTG, at 5 mM final for 3 h, to 250 ml of culture 
of the strain in LB medium containing 100 ug/ml of ampicillin pre- 
incubated at 37°C for 2 h. The centrifuged cultures are taken up in 
25 ml of TENGN buffer. The lysate is sonicated and recentri f uged . 
The fusion protein is then purified using 4 ml of beads bearing 
covalently attached insoluble glutathione (Sigma) per 25 ml of 
supernatant. After incubation for 4 h, they are washed with 3 
volumes of 1M NaCl and then with 10 volumes of 1 X PBS. The elution 
is obtained by bringing the beads into contact with a glutathione 
solution (10 mM reduced glutathione, 50 mM Tris-HCl, pH 8.0) for 10 
min . 
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REMARKS 

Applicants believe that the present application is now in condition for allowance. 
Favorable reconsideration of the application as amended is respectfully requested. 

The Examiner is invited to contact the undersigned by telephone if it is felt that a 
telephone interview would advance the prosecution of the present application. 

Respectfully submitted. 



Date August 14, 2002 




FOLEY & LARDNER 
Customer Number: 22428 

III ll l III ll I II 

22428 

Telephone: (202) 672-5407 
Facsimile: (202) 672-5399 



Stephen Maebius 
Attorney for Applicants 
Registration No. 35,264 



Should additional fees be necessary in connection with the filing of this paper, or if a petition for 
extension of time is required for timely acceptance of same, the Commissioner is hereby 
authorized to charge Deposit Account No. 19-0741 for any such fees; and applicant(s) hereby 
petition for any needed extension of time. 
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MARKED UP VERSION ATTACHED TO AMENDMENT IN 

SERIAL NO. 10/049.372 

Marked up version of the paragraph on page 6, lines 28-30 is below: 

The polypeptides according to the invention are characterized in 
that they preferably comprise the Gly-Thr-Trp-Tyr (SEQ ID NO: 25) 
domain . 

Marked up version of the paragraph on page 33, lines 6-15, is below: 

The upper lines represent the hOBPIIa sequences (SEQ ID NOS 17-19, 
21-24) and the lower lines represent the hOBPIIb sequence (SEQ ID 
NO: 20) for which only the different nucleotides are represented; a 
dash indicates the absence of corresponding sequences. The shaded 
capitals are the exon sequences and the lower case letters are the 
intron sequences. The sizes indicated on the left are indicated in 
bp. The TATA box is in bold characters and the polyadenylat ion 
signal is underlined. The boxes indicate the splice acceptor sites 
for exons 5, 5b and 5c. 

Marked up version of the paragraph on page 33, lines 35-39 thru page 34 lines 
1-4, is below: 

FIGURE 5 : Aligment of the protein sequences derived, from the two 
human genes hOBPIIa and hOBPIIb (hOBPIIacc = OBP2aaHOMSA (SEQ ID NO: 
2) , hOBPIIbot = OBP2baHOMSA (SEQ ID NO: 10) , hOBPIIbp = OPB2bbHOMSA 

(residues 15-179 of SEQ ID NO: 12) , hOBPIIay = OBP2 ag/HOMSA (SEQ ID 
NO: 6) , hOBPIIap = OBP2 abHOMSA (SEQ ID NO: 4) ) , of human tear 
lipocalins (LCNl HOMSA) (SEQ ID NO: 46) , of rat OBPII 

(OBP2 RATNO) (SEQ ID NO: 47) , of bovine lactoglobulin BLG 

(LACB BOSTA) (SEQ ID NO: 48) , of mouse MUP (MUP6 MUSMU) (SEQ ID NO: 
49) , of human RBP (RBP HOMSA) (SEQ ID NO: 50) , of bovine OBP 

(OBP BOSTA) (SEQ ID NO: 51) , of rat MUP (MUP RATNO) (SEQ ID NO: 52) 
and of porcine OBP (OBP SUSSC) (SEQ ID NO: 53) . 
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Marked up version of the paragraph on page 34, lines 6-16, is below: 

The residues in the dark grey boxes are identical and those in the 
light gray boxes are similar. The elements of secondary structure 
predicted with the DSC program are underlined and the amino acid 
residues are in italics. The (3-sheets and the a-helices are 
numbered for hOPBIIa and b. The anticipated signal peptide cleavage 
site is indicated with an arrow (AAA^LS) at position 15. Nonaligned 
sequences of very divergent forms of spliced genes, hOPBa6 
(0PB2adH0MSA) (SEQ ID NO: 8) and hOBPby (OBP2bgHOMSA) (SEQ ID NO: 14) 
have been added at the bottom of the alignment after the analysis. 

Marked up version of the paragraph on page 39, lines 30-38 thru page 40, lines 
1-2, is below: 

A Lambda gtll human testes cDNA library (Clontech) (10 7 p.f.u.) was 
amplified by 30 polymerase chain reaction (PCR) cycles (94°C 45 sec, 
54°C 45 sec, 72°C 1 min 30 sec) with the primer oliEST58 
CCTGCAGGTACATGAGCTTCC (SEQ ID NO: 26) and 5' or 3' screening 
amplimers for inserts located on the arms of the lambda gtll 
vectors. A nested PCR was then carried out with oliEST26 
CGCTGTATTTGCCAGGCTCC (SEQ ID NO: 27) and oligonucleotides specific 
for the arm of the vector. The PCR products were subcloned into the 
pGEM-T (r) vector, which made it possible to obtain the 5' end of 
the hOBPII gene cDNAs . 

Marked up version of the paragraph on page 40, lines 20-39 thru page 41, lines 
1-5, is below: 

Tissue samples were collected from 45 to 55 year old Caucasian 
individuals, in agreement with the French regulations in force. The 
total RNA is extracted according to a single-step method using the 
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RNA NOW® reagent, according to the manufacturer's (Biogentex) 
instructions. 5 ug of total RNA were reversed-transcribed in a 
final volume of 20 ul using 0.5 ng of oligonucleotide 
GACTCGAGTCGACATCGATTTTTTTTTTTTTTTTT (SEQ ID NO: 28) with the 
Superscript® preamplif ication system (Gibco BRL) . Three ul of these 
reactions were then used for the following PCRs . The expression of 
the specific mRNAs was determined by PCR using: the primers TL : 
CCTCTCCCAGCCCCAGCAAG (SEQ ID NO: 29) and AP: GACTCGAGTCGACATCG (SEQ 
ID NO: 30) for the LCNl-type genes (LCN1, LCNlb, LCNlc) and, for the 
hOBPII-type genes, the primers DE : CGCCCAGTGACCTGCCGAGGTC (SEQ ID 
NO: 31) , and FI : CTTTATTTGGAGTCAGGTGGGTG (SEQ ID NO: 32) . As a 
quality control for the RNA' s the primers G3PDH1: 
CTCTGCCCCCTCTGCTGATG (SEQ ID NO: 33) and G3PDH2: 

CCTGCTTCACCACCTTCTTG (SEQ ID NO: 34) for the G3PDH gene were used; 
the G3PDH gene is considered to be cons titutively expressed in all 
cell types. 32 PCR cycles (94°C 45 sec, 54°C 45 sec, 72°C 2 rain 30 
sec) were performed and the amplification products were separated on 
1% agarose gel. The DNA was transferred onto a Hybond N+® membrane. 



Marked up version of the paragraph on page 41, lines 6-16, is below: 

For the detection of the expression of the various genes, several 
oligonucleotides specific for the respective genes were synthesized: 

- olLCNl : GACTCAGACTCCGGAGATGA (SEQ ID NO: 35) , 

- olLCNlb: AAC T C AGAC AC C AGAG AT GA (SEQ ID NO: 36) , 

- olLCNlc: G AC T C AG AT C C C G GAG AT G A (SEQ ID NO : 37) , 

- ol5: CCAGGAGGGACCACTACA (SEQ ID NO: 38) specific for the hOBPIIb 
gene , 

- ol4: CCGGGACGGACGACTACG (SEQ ID NO: 39) specific for the hOBPIIa 
gene, 
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- G3PDH3: CTCATGACCACAGTCCATGC (SEQ ID NO: 40) . 

Marked up version of the paragraph on page 41, lines 30-39 thru page 42, 
lines1-7, is below: 

The genotyping is carried out with PCR reactions using 100 ng of 
genomic DNA originating from 8 CEPH reference families, and using 
the oligonucleotides oli9 TGTTCGGGAACGCAGCTT (SEQ ID NO: 41) and 
olilOa TGCCGCTGTCCCCACGTCGG (SEQ ID NO: 42) . The thermocycler 
parameters consist of an initial cycle at 94°C for 10 min followed 
by 30 cycles at 94°C for 30 sec, 55°C for 30 sec and 70°C for 45 
sec, and then a final elongation step of 10 min at 70°C. The PCR 
products are then analyzed on a 3% agarose gel. The information 
regarding the markers for chromosome 9 may be obtained at the 
following internet address ( hTTP : //gal ton .ucl.ac.uk ) ; the analyses 
were carried out using the linkage study tools previously described 
in Lacazette et al. (1997). The haplotypes are reconstructed 
manually according to the recombination events previously described 
in family 1362 (Attwood et al., 1994). 



Marked up version of the paragraph on page 52, lines 34-39 thru page 54, 
linesl-11, is below: 

In the case of the hOBPIIb gene, in addition to the hOBPIIba mRNA 
previously described, a 106 bp super-numerary exon (exon 3b) between 
the previous exons 3 and 4 was identified (figure 3) . This longer 
mRNA (782 nucleotides) encodes a 165 amino acid protein hOBPIIb(3. 
From a protein structure point of view, hOBPIIb(3 is identical to 
hOBPIIba up to the 5 th putative (3-sheet and then differs due to a 
reading frame shift. The predictions from the computer programs 
indicate that the motif ALWEALAI DTRLK (SEQ ID NO: 43) is an a-helix 
which is just behind the fifth |3-sheet. Two additional |3-sheets may 
be present in the long C-terminal portion. 
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Marked up version of the paragraph on page 58, lines 36-39 thru page 59, 
lines'! -23, is below: 

A PCR on the plasmid DNA of an hOBPIIba clone using the primers 
Blla/b (5' GTC GGA TCC CTG TCC TTC ACC CTG GAG G 3') (SEQ ID NO: 
44) , a sense oligonucleotide beginning 45 bases after the protein- 
initiating ATG , and Xllb (5; GTC CTC GAG GTG TTC GGG AAC GCA GCT TC 
3' ) (SEQ ID NO: 45) , an antisense oligonucleotide preceding the stop 
codon of the hOBPIIba protein, made it possible to amplify all of 
the DNA encoding the secreted hOBPIIba protein. The BanH I and Xho 
I enzymatic restriction sites located at the ends of the two 
oligonucleotides (bases underlined) were used for directional 
cloning into a plasmid expression vector pGEX-6Pl, followed by 
transformation by electroporation (1800 V, 200 Q, 25 uF) into 
bacterial strain BL21. Synthesis of the recombinant protein is 
obtained by adding IPTG, at 5 mM final for 3 h, to 250 ml of culture 
of the strain in LB medium containing 100 ug/ml of ampicillin pre- 
incubated at 37 °C for 2 h. The centrifuged cultures are taken up in 
25 ml of TENGN buffer. The lysate is sonicated and recentr i f uged . 
The fusion protein is then purified using 4 ml of beads bearing 
covalently attached insoluble glutathione (Sigma) per 25 ml of 
supernatant. After incubation for 4 h, they are washed with 3 
volumes of 1M NaCl and then with 10 volumes of 1 X PBS. The elution 
is obtained by bringing the beads into contact with a glutathione 
solution (10 mM reduced glutathione, 50 mM Tris-HCl, pH 8.0) for 10 
min . 
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CLAIMS 

1. An isolated polypeptide comprising an amino acid 
sequence having at least 90% identity with the amino 
acid sequences SEQ ID No. 2, SEQ ID No . 4 , SEQ ID 
No. 6, SEQ ID No. 8, SEQ ID No. 12 or SEQ ID No. 14. 

2. An isolated polypeptide, characterized in that it 
comprises a polypeptide chosen from: 

a) a polypeptide of sequence SEQ ID No. 2, SEQ ID No. 4, 
SEQ ID No. 6, SEQ ID No . 8, SEQ ID No. 12 or SEQ ID 
No. 14; 

b) a polypeptide which is a variant of a polypeptide of 
amino acid sequences defined in a) ; 

c) a polypeptide homologous to the polypeptide defined 
in a) or b) and comprising at least 90% identity 
with said polypeptide of a) ; 

d) a fragment of at least 15 consecutive amino acids of 
a polypeptide defined in a) , b) or c) ; 

e) a biologically active fragment of a polypeptide 
defined in a) , b) or c) . 

3. An isolated polypeptide selected from a poly- 
peptide corresponding to the sequence SEQ ID No. 2 and 
named OBPII acX/ to the sequence SEQ ID No. 4 and named 
0BPII a p, to the sequence SEQ ID No . 6 and named OBPII ay , 
and to the sequence SEQ ID No. 12 and named OBPII b p. 

4. The polypeptide as claimed in any one of claims 1 
to 3, characterized in that it comprises at least the 
Gly-Thr-Trp-Tyr domain. 
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5. An isolated polynucleotide, characterized in that 
it encodes a polypeptide as claimed in any one of 
claims 1 to 4 . 

6. The polynucleotide as claimed in claim 5, of 
sequence SEQ ID No. 1, SEQ ID No . 3, SEQ ID No . 5, SEQ 
ID No. 7, SEQ ID No. 11 or SEQ ID No. 13. 

7. An isolated polynucleotide, characterized in that 
it comprises a polynucleotide chosen from: 

a) a polynucleotide of sequence SEQ ID No. 1, SEQ ID 
No. 3, SEQ ID No. 5, SEQ ID No . 7, SEQ ID No. 11 or 
SEQ ID No. 13, or the sequence of which is that of 
the RNA corresponding to the sequence SEQ ID No . 1 , 
SEQ ID No. 3, SEQ ID No. 5, SEQ ID No. 7, SEQ ID 
No. 11 or SEQ ID No. 13; 

b) a polynucleotide, the sequence of which is comple- 
mentary to the sequence of a polynucleotide defined 
in a) , 

c) a polynucleotide, the sequence of which comprises at 
least 80% identity with a polynucleotide defined in 
a) or b) , 

d) a fragment of at least 15 consecutive nucleotides, 
in preference 21 consecutive nucleotides, and 
preferably 3 0 consecutive nucleotides, of a poly- 
nucleotide defined in a) , b) , c) or d) . 

8. The polynucleotide as claimed in claim 7, charac- 
terized in that it is labeled directly or indirectly 
with a radioactive compound or a nonradioactive 
compound. 

9. The use of a polynucleotide as claimed in claim 8, 
as a probe for detecting nucleic acid sequences. 
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10. The use of a polynucleotide as claimed in any one 
of claims 5 to 8, as a primer for amplifying or 
polymerizing nucleic acid sequences. 

5 

11. The use of a polynucleotide as claimed in claim 8, 
as a sense or antisense oligonucleotide for controlling 
the expression of the corresponding protein product. 

10 12 . A recombinant vector for cloning a polynucleotide 
as claimed in one of claims 5 to 8 and/or for 
expressing a polypeptide as claimed in one of claims 1 
to 4, characterized in that it contains a poly- 
nucleotide as claimed in any one of claims 5 to 8 . 

15 

13. The vector as claimed in claim 12, characterized 
in that it comprises the elements which allow the 
expression, optionally the secretion, of said poly- 
peptide in a host cell. 

20 

14. The vector as claimed in any one of claims 12 to 
13, characterized in that the elements which allow the 
expression of said polypeptide are chosen from: 

25 a) the isolated polynucleotide of sequence SEQ ID 
No. 15 and SEQ ID No. 16; 

b) a polynucleotide, the sequence of which is comple- 
mentary to the sequence of the polynucleotide 

30 defined in a) ; 

c) a polynucleotide, the sequence of which comprises at 
least 80% identity with a polynucleotide defined in 
a) or in b) ; 

35 

d) a polynucleotide which hybridizes, under conditions 
of high stringency, with a sequence of the poly- 
nucleotide defined in a) , b) or c) . 
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15. The vector as claimed in claims 13 and 14, for 
expression in eukaryotic cells, selected from viral DNA 
and naked DNA. 

5 

16. A host cell, characterized in that it is trans- 
formed with a vector as claimed in one of claims 12 to 
15 . 

10 17. A process for preparing a recombinant polypeptide, 
characterized in that a host cell as claimed in 
claim 16 is cultured under conditions which allow the 
expression and, optionally, the secretion of said 
recombinant polypeptide, and in that said recombinant 

15 polypeptide is recovered. 

18 . A recombinant polypeptide which can be obtained 
using a process as claimed in claim 17 . 

20 19. The use of a polypeptide chosen from a polypeptide 
as claimed in any one of claims 1 to 4 and 18, or a 
fragment thereof, as a protein which binds to a hydro- 
phobic ligand, preferably an odorous molecule. 

25 20. The use of a polypeptide as claimed in any one of 
claims 1 to 4 and 18, or of the polypeptide hOBPII bct of 
sequence SEQ ID No. 10, a polypeptide which is a 
variant of said polypeptide hOBPII bct , a polypeptide 
homologous to said polypeptide hOBPIIba comprising at 

3 0 least 90% identity with said polypeptide hOBPII ba or a 
fragment thereof, as a competitive inhibitor, as an 
agonist or as an antagonist of the cellular receptors 
for lipocalins. 

35 21. A monoclonal or polyclonal antibody, and fragments 
thereof, characterized in that it is specifically 
against an isolated polypeptide as claimed in one of 
claims 1 to 4 and 18 . 
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22. The use of an antibody as claimed in claim 21, for 
demonstrating the presence of a polypeptide as claimed 
in one of claims 1 to 4 and 18 in a biological sample. 



23. A process for detecting an antibody against hOBPII 
in human serum from an allergic and/or asthmatic 
patient, using an hOBPII polypeptide. 

24. A process for controlling the volatilization of an 
odorant, characterized in that it comprises a step of 
binding of said odorant with a polypeptide as claimed 
in any one of claims 1 to 4 and 18, or with the poly- 
peptide hOBPII ba of sequence SEQ ID No. 10, a poly- 
peptide which is a variant of said polypeptide hOBPIIba 
or a polypeptide homologous to said polypeptide hOBPIIba 
comprising at least 9 0% identity with said polypeptide 
hOBPIIba - 



25. The process as claimed in claim 24, characterized 
in that the polypeptide is bound to a solid support. 

26. The process as claimed in claim 24, characterized 
in that the polypeptide is in a liquid composition. 

27. The process as claimed in claim 26, characterized 
in that said composition is a fragranced composition 
for the skin. 



28. A process for screening a molecule, preferably 
odorants or flavors, which comprises passing the 
molecule over a substrate which comprises a polypeptide 
as claimed in any one of claims 1 to 4 and 18, or the 
polypeptide hOBPII ba of sequence SEQ ID No . 10, a poly- 
peptide which is a variant of said polypeptide hOBPIIba 
or a polypeptide homologous to said polypeptide hOBPII ba 
comprising at least 90% identity with said polypeptide 
hOBPII b a, bound to said substrate, said polypeptide 
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binding said odorant or flavor, and recovering said 
odorant or flavor from the polypeptide if necessary. 

29. The process as claimed in claim 28, characterized 
5 in that the molecules are human pheromones . 

30. A process for solubilizing lipophilic molecules, 
characterized in that it comprises binding said 
molecule to a polypeptide as claimed in any one of 

10 claims 1 to 4 and 18, or to the polypeptide hOBPII ba of 
sequence SEQ ID No. 10, a polypeptide which is a 
variant of said polypeptide hOBPII bct or a polypeptide 
homologous to said polypeptide hOBPII ba comprising at 
least 90% identity with said polypeptide hOBPII ba . 

15 

31. The application of the polypeptides as claimed in 
one of claims 1 to 4 and 18, or of the polypeptide 
hOBPII ba of sequence SEQ ID No. 10, of a polypeptide 
which is a variant of said polypeptide hOBPII ba or of a 

2 0 polypeptide homologous to said polypeptide hOBPIl ba 

comprising at least 90% identity with said polypeptide 
hOBPII ba , in combination with dietary fatty acids, as a 
food supplement. 

25 32. The application as claimed in claim 31, for 
preparing a medicinal product intended for the 
treatment of hyperlipidemias and obesity. 

33. The application as claimed in claim 31, for 
30 supplementing nonmaternal milks. 

34. The polypeptide as claimed in any one of claims 1 
to 4 and 18, or the polypeptide hOBPIIfaa of sequence SEQ 
ID No. 10, a polypeptide which is variant of said poly- 

3 5 peptide hOBPII ba or a polypeptide homologous to said 

polypeptide hOBPIIba comprising at least 90% identity 
with said polypeptide hOBPIIba, as an agent for 
targeting a pharmaceutical compound. 
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35. The polypeptide as claimed in claim 34, charac- 
terized in that said polypeptide is expressed in the 
form of a protein from fusion with a protein which 

5 allows specific cellular addressing. 

36. The polypeptide as claimed in claim 35, charac- 
terized in that said protein which allows specific 
cellular addressing is chosen from the group composed 

10 of interleukins , of cytokines, of lymphokines, of 
interferons, of growth factors, of hormones and of 
antibodies . 

37. The polypeptide as claimed in claim 34, charac- 
15 terized in that said polypeptide is combined with a 

molecule which allows specific cellular addressing. 

38. The polypeptide as claimed in claim 37, charac- 
terized in that said molecule which allows specific 

20 cellular addressing is chosen from the group composed 
of steroids, of interleukins, of cytokines, of lympho- 
kines, of interferons, of growth factors, of hormones 
and of antibodies . 

25 39. The polypeptide as claimed in any one of claims 1 
to 4 and 18, or the polypeptide hOBPII bot of sequence SEQ 
ID No. 10, a polypeptide which is a variant of said 
polypeptide hOBPIIba or a polypeptide homologous to said 
polypeptide hOBPII ba comprising at least 90% identity 

3 0 with said polypeptide hOBPII bot , as a pharmaceutical 
compound transporter. 

40. A pharmaceutical composition comprising a pharma- 
ceutical compound bound at least to a polypeptide as 

35 claimed in one of claims 3 4 to 39, and a pharma- 
ceutically acceptable vehicle. 

41. The pharmaceutical composition as claimed in 
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claim 40, characterized in that the pharmaceutical 
compound is chosen from the group of anticancer agents. 

42 . The pharmaceutical composition as claimed in 
5 claim 41, characterized in that said anticancer agent 
is a radioactive isotope chosen from the group: 
iodine 131 , yttrium 90 , gold 199 , palladium 100 , copper 67 , 
bismuth 217 and antimony 211 . 

10 43 . The pharmaceutical composition as claimed in any 
one of claims 40 to 42, characterized in that said 
polypeptide as claimed in any one of claims 34 to 39 
constitutes a delayed form of delivery of said 
pharmaceutical compound in the body. 

15 

44. A pharmaceutical composition comprising an 
expression vector as claimed in claim 12 or 13, and a 
pharmaceutically acceptable vehicle. 

20 45. The pharmaceutical composition as claimed in any 
one of claims 40 to 44, for treating cancer preferably 
chosen from breast cancer, uterine cancer, prostate 
cancer, liver cancer and pulmonary epithelial cell 
carcinoma . 

25 

46. The use of a polypeptide chosen from the poly- 
peptide hOBPII bot of sequence SEQ ID No. 10, a poly- 
peptide which is a variant of said polypeptide hOBPII bot 
and a polypeptide homologous to said polypeptide 

30 hOBPIIba comprising at least 90% identity with said 
polypeptide hOBPII ba , for preparing a medicinal product 
intended for the treatment of uterine cancer, prostate 
cancer, liver cancer and pulmonary epithelial cell 
cancer . 

35 

47 . The use of the polypeptide as claimed in any one 
of claims 1 to 4 and 18, or the polypeptide hOBPII bot of 
sequence SEQ ID No. 10, or a polypeptide which is a 
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variant of said polypeptide hOBPII bC( or a polypeptide 
homologous to said polypeptide hOBPII ba comprising at 
least 90% identity with said polypeptide hOBPII ba> for 
preparing a medicinal product intended for the 
5 transport of a compound across the placental barrier. 

48. The use as claimed in claim 47, characterized in 
that said transport is carried out from the gestating 
mother to the fetus, and in that said compound is 

10 chosen from hormones, essential fatty acids, lipophilic 
medicinal products and vitamins. 

49. The use as claimed in claim 47, characterized in 
that said transport is carried out from the fetus to 

15 the mother and is intended for the detoxification of 
the fetus . 

50. The polypeptide as claimed in claims 1 to 4 and 
18, or the polypeptide hOBPIIba of sequence SEQ ID 

2 0 No. 10, a polypeptide which is a variant of said poly- 
peptide hOBPIIba or- a polypeptide homologous to said 
polypeptide hOBPII btx comprising at least 9 0% identity 
with said polypeptide hOBPII bce , as a pregnancy marker. 

25 51. The polypeptide as claimed in claims 1 to 4 and 
18, or the polypeptide hOBPII ba of sequence SEQ ID 
No. 10, a polypeptide which is a variant of said poly- 
peptide hOBPIIha or a polypeptide homologous to said 
polypeptide hOBPIIfaa comprising at least 90% identity 

30 with said polypeptide hOBPII ba , as a marker for a feto- 
placental pathological condition 
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SEQUENCE LISTING 

<110> PITIOT, GILLES 
LACAZETTE, ERIC 
GACHON , FRANCO I SE 



<120> HUMAN ODORANT- BINDING PROTEINS WHICH BIND HYDROPHOBIC 

LIGANDS : POLYPEPTIDES AND POLYNUCLEOTIDES ENCODING SAID 
POLYPEPTIDES, AND USES THEREOF 

<130> 065691/0270 



<140> 10/049,372 
<141> 2002-02-12 



<150> PCT/FR00/02319 
<151> 2000-08-11 



<150> FR 99 10439 
<151> 1999-08-12 



<160> 53 



<170> Patentln Ver . 2.1 



<210> 1 

<211> 676 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43) . . (552) 



<220> 

<223> CDNA396 (676) /gl (hOBPIIa-alpha) 



<400> 1 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 54 

Met Lys Thr Leu 



ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 102 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 



gag gag gag gat ate aca ggg acc tgg tac gtg aag gec atg gtg gtc 150 
Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys Ala Met Val Val 
25 30 35 



gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec cca gtg 198 
Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser Pro Val 



aag gtg aca gee ctg ggc ggt ggg aac ttg gaa gec acg ttc acc ttc 246 
Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala Thr Phe Thr Phe 
55 60 65 



2 



atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg aag acg 2 94 
Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu Met Arg Lys Thr 
70 75 80 



gag gag cct ggc aaa ttc age gee tat ggg ggc agg aag etc ata tac 342 
Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg Lys Leu lie Tyr 
85 90 95 100 



ctg cag gag ctg ccc ggg acg gac gac tac gtc ttt tac tgc aaa gac 
Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe Tyr Cys Lys Asp 
105 110 " 115 



cag cgc cgt ggg ggc ctg cgc tac atg gga aag ctt gtg ggt agg aat 438 
Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu Val Gly Arg Asn 
120 125 130 



cct aat acc aac ctg gag gec ctg gaa gaa ttt aag aaa ttg gtg cag 486 
Pro Asn Thr Asn Leu Glu Ala Leu Glu Glu Phe Lys Lys Leu Val Gin 
135 140 145 



cac aag gga etc teg gag gag gac att ttc atg ccc ctg cag acg gga 534 
His Lys Gly Leu Ser Glu Glu Asp lie Phe Met Pro Leu Gin Thr Gly 
150 155 160 

age tgc gtt etc gaa cac taggcagccc ccgggtctgc acctccagag 582 
Ser Cys Val Leu Glu His 
165 170 

cccaccctac caccagacac agagecegga ccacctggac ctaccctcca gccatgaccc 642 
ttccctgctc ccacccacct gactccaaat aaag 676 



<210> 2 
<211> 170 
<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 10 15 



Ser Phe Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys 

20 25 30 

Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys 
35 40 45 

Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala 
50 55 60 

Thr Phe Thr Phe Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu 
65 70 75 80 

Met Arg Lys Thr Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg 
85 90 95 



3 



Lys Leu lie Tyr 
100 

Tyr Cys Lys Asp 
115 

Val Gly Arg Asn 
130 

Lys Leu Val Gin 
145 

Leu Gin Thr Gly 



Leu Gin Glu Leu 



Gin Arg Arg Gly 
120 

Pro Asn Thr Asn 
135 

His Lys Gly Leu 
150 

Ser Cys Val Leu 
165 



Pro Gly Thr Asp 
105 

Gly Leu Arg Tyr 



Leu Glu Ala Leu 
140 

Ser Glu Glu Asp 
155 

Glu His 
170 



Asp Tyr Val Phe 
110 

Met Gly Lys Leu 
125 

Glu Glu Phe Lys 



lie Phe Met Pro 
160 



<210> 3 

<211> 725 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43) . . (480) 

<220> 

<223> CDNA396 (725) /SM12 (hOBPIIa-beta) 
<400> 3 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 54 

Met Lys Thr Leu 
1 

ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 102 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 
5 10 15 20 

gag gag gag gat ate aca ggg acc tgg tac gtg aag gee atg gtg gtc 15 0 
Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys Ala Met Val Val 
25 30 35 

gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec cca gtg 198 
Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser Pro Val 
40 45 50 

aag gtg aca gec ctg ggc ggt ggg aac ttg gaa gee acg ttc acc ttc 246 
Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala Thr Phe Thr Phe 
55 60 65 

atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg aag acg 294 
Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu Met Arg Lys Thr 
70 75 80 



gag gag cct ggc aaa ttc age gee tat ggg ggc agg aag etc ata tac 342 
Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg Lys Leu lie Tyr 
85 90 95 100 



ctg cag gag ctg ccc ggg acg gac gac tac gtc ttt tac tgc aaa gac 3 90 
Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe Tyr Cys Lys Asp 
105 110 115 

cag cgc cgt ggg ggc ctg cgc tac atg gga aag ctt gtg ggg ccg tgc 438 
Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu Val Gly Pro Cys 
120 125 130 

cgc tgt ccc cac gtc ggc tea cct ggc cac etc acc tgc agg 480 
Arg Cys Pro His Val Gly Ser Pro Gly His Leu Thr Cys Arg 
135 140 145 

taggaatcct aataccaacc tggaggcect ggaagaattt aagaaattgg tgcagcacaa 540 

gggactctcg gaggaggaca ttttcatgee cctgcagacg ggaagctgcg ttctcgaaca 600 

ctaggcagcc cccgggtctg cacctccaga gcccacccta ccaccagaca cagagcccgg 66 0 

accacctgga cctaccctcc agccatgacc cttccctgct cccacccacc tgactccaaa 720 

taaag 725 

<210> 4 

<211> 146 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 10 15 

Ser Phe Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys 
20 25 " 30 

Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys 
35 40 45 

Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala 
50 55 60 

Thr Phe Thr Phe Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu 
65 70 75 80 

Met Arg Lys Thr Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg 
85 90 95 

Lys Leu He Tyr Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe 
100 105 110 

Tyr Cys Lys Asp Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu 
115 120 125 

Val Gly Pro Cys Arg Cys Pro His Val Gly Ser Pro Gly His Leu Thr 
130 135 140 

Cys Arg 
145 



<210> 5 

<211> 741 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43) . . (726) 

<220> 

<223> CDNA396 (741) / SM4 (hOBPIIa-gamma) 
<400> 5 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 54 

Met Lys Thr Leu 
1 

ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 102 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 
5 10 15 20 

gag gag gag gat ate aca ggg acc tgg tac gtg aag gee atg gtg gtc 15 0 
Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys Ala Met Val Val 
25 30 35 

gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec cca gtg 198 
Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser Pro Val 
40 45 50 

aag gtg aca gee ctg ggc ggt ggg aac ttg gaa gec acg ttc acc ttc 246 
Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala Thr Phe Thr Phe 
55 60 65 

atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg aag acg 294 
Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu Met Arg Lys Thr 
70 75 80 

gag gag cct ggc aaa ttc age gec tat ggg ggc agg aag etc ata tac 342 
Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg Lys Leu lie Tyr 
85 90 95 100 

ctg cag gag ctg ccc ggg acg gac gac tac gtc ttt tac tgc aaa gac 390 
Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe Tyr Cys Lys Asp 
105 110 115 

cag cgc cgt ggg ggc ctg cgc tac atg gga aag ctt gtg gca tct get 43 8 
Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu Val Ala Ser Ala 
120 125 130 

ccc tgc agg gec gtg ccg ctg tec cca cgt egg etc acc tgg cca cct 486 
Pro Cys Arg Ala Val Pro Leu Ser Pro Arg Arg Leu Thr Trp Pro Pro 
135 140 145 



6 



cac ctg cag gta gga ate eta ata cca acc tgg agg ccc tgg aag aat 534 
His Leu Gin Val Gly lie Leu lie Pro Thr Trp Arg Pro Trp Lys Asn 
150 155 160 

tta aga aat tgg tgc age aca agg gac tct egg agg agg aca ttt tea 582 
Leu Arg Asn Trp Cys Ser Thr Arg Asp Ser Arg Arg Arg Thr Phe Ser 
165 170 _ 175 180 

tgc ccc tgc aga egg gaa get gcg ttc teg aac act agg cag ccc ccg 63 0 
Cys Pro Cys Arg Arg Glu Ala Ala Phe Ser Asn Thr Arg Gin Pro Pro 
185 190 195 

ggt ctg cac etc cag age cca ccc tac cac cag aca cag age ccg gac 678 
Gly Leu His Leu Gin Ser Pro Pro Tyr His Gin Thr Gin Ser Pro Asp 
200 205 210 

cac ctg gac eta ccc tec age cat gac cct tec ctg etc cca ccc acc 726 
His Leu Asp Leu Pro Ser Ser His Asp Pro Ser Leu Leu Pro Pro Thr 
215 220 225 

tgactccaaa taaag 741 

<210> 6 
<211> 228 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 10 15 

Ser Phe Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys 
20 25 ~ 30 

Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys 
35 40 45 

Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala 
50 55 60 

Thr Phe Thr Phe Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu 
65 70 75 80 

Met Arg Lys Thr Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg 
85 90 95 

Lys Leu lie Tyr Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe 
100 105 110 

Tyr Cys Lys Asp Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu 
115 120 125 

Val Ala Ser Ala Pro Cys Arg Ala Val Pro Leu Ser Pro Arg Arg Leu 
130 135 140 



Thr Trp Pro Pro His Leu Gin Val Gly lie Leu lie Pro Thr Trp Arg 
145 150 155 160 
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Pro Trp Lys Asn Leu Arg Asn Trp Cys Ser Thr Arg Asp Ser Arg Arg 
1S5 170 175 

Arg Thr Phe Ser Cys Pro Cys Arg Arg Glu Ala Ala Phe Ser Asn Thr 
180 185 190 

Arg Gin Pro Pro Gly Leu His Leu Gin Ser Pro Pro Tyr His Gin Thr 
195 200 205 

Gin Ser Pro Asp His Leu Asp Leu Pro Ser Ser His Asp Pro Ser Leu 

210 215 • 220 

Leu Pro Pro Thr 
225 



<210> 7 

<211> 607 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43) . . (483) 

<220> 

<223> CDNA3 96 (607) - short form (hOBPIIa-delta) 
<400> 7 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 54 

Met Lys Thr Leu 



ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 102 

Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 

5 10 15 20 

gag gag gag gat gag gga gaa teg gtg cat cca gaa gaa aat cct gat 15 0 

Glu Glu Glu Asp Glu Gly Glu Ser Val His Pro Glu Glu Asn Pro Asp 

25 30 35 

gcg gaa gac gga gga gee tgg caa att cag cgc eta tgg ggg cag gaa 198 

Ala Glu Asp Gly Gly Ala Trp Gin lie Gin Arg Leu Trp Gly Gin Glu 

40 45 50 

get cat ata cct gca gga get gee egg gac gga cga eta cgt ctt tta 246 

Ala His lie Pro Ala Gly Ala Ala Arg Asp Gly Arg Leu Arg Leu Leu 
55 60 65 

ctg caa aga cca gcg ccg tgg ggg cct gcg eta cat ggg aaa get tgt 2 94 

Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Leu His Gly Lys Ala Cys 
70 75 80 



ggc ate tgc tec ctg cag ggc cgt gee get gtc ccc acc ttg get cac 342 
Gly lie Cys Ser Leu Gin Gly Arg Ala Ala Val Pro Thr Leu Ala His 
85 90 95 100 



ctg gcc acc tea cct gca ggt agg aat cct aat acc aac ctg gag gec 390 
Leu Ala Thr Ser Pro Ala Gly Arg Asn Pro Asn Thr Asn Leu Glu Ala 
105 110 115 



ctg gaa gaa ttt aag aaa ttg gtg cag cgc aag gga etc teg gag gag 43 8 
Leu Glu Glu Phe Lys Lys Leu Val Gin Arg Lys Gly Leu Ser Glu Glu 
120 125 130 

gac att ttc atg ccc ctg cag acg gga age tgc gtt etc gaa cac 483 
Asp lie Phe Met Pro Leu Gin Thr Gly Ser Cys Val Leu Glu His 
135 140 145 

taggcagccc ccgggtctgc acctccagag cccaccctac caccagacac agagecegga 543 

ccacctggac ctaccctcca gccatgaccc ttccctgctc ccacccacct gactccaaat 603 

aaag 607 



<210> 8 
<211> 147 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 



Ser Phe Thr Leu Glu Glu Glu Asp Glu Gly Glu Ser Val His Pro Glu 

20 25 30 

Glu Asn Pro Asp Ala Glu Asp Gly Gly Ala Trp Gin lie Gin Arg Leu 

35 40 45 

Trp Gly Gin Glu Ala His lie Pro Ala Gly Ala Ala Arg Asp Gly Arg 

50 55 60 

Leu Arg Leu Leu Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Leu His 



Gly Lys Ala Cys Gly lie Cys Ser Leu Gin Gly Arg Ala Ala Val Pro 
85 90 95 

Thr Leu Ala His Leu Ala Thr Ser Pro Ala Gly Arg Asn Pro Asn Thr 
100 105 110 

Asn Leu Glu Ala Leu Glu Glu Phe Lys Lys Leu Val Gin Arg Lys Gly 
115 120 125 

Leu Ser Glu Glu Asp lie Phe Met Pro Leu Gin Thr Gly Ser Cys Val 
130 135 140 



Leu Glu His 
145 



<210> 9 

<211> 676 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43) . . (552) 

<220> 

<223> CDNA2098 (676) - conventional form (hOBPIIb-alpha) 
<400> 9 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 

Met Lys Thr Leu 



ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 



gag gag gag gat ate aca ggg acc tgg tac gtg aag gec atg gtg gtc 
Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys Ala Met Val Val 



gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec cca gtg 
Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser Pro Val 



aag gtg aca gee ctg ggc ggt ggg aag ttg gaa gec acg ttc acc ttc 
Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala Thr Phe Thr Phe 



atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg aag acg 
Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu Met Arg Lys Thr 



gag gag cct ggc aaa tac age gec tat ggg ggc agg aag etc atg tac 
Glu Glu Pro Gly Lys Tyr Ser Ala Tyr Gly Gly Arg Lys Leu Met Tyr 
85 90 95 100 

ctg cag gag ctg ccc agg agg gac cac tac ate ttt tac tgc aaa gac 
Leu Gin Glu Leu Pro Arg Arg Asp His Tyr lie Phe Tyr Cys Lys Asp 
105 110 115 

cag cac cat ggg ggc ctg etc cac atg gga aag ctt gtg ggt agg aat 
Gin His His Gly Gly Leu Leu His Met Gly Lys Leu Val Gly Arg Asn 
120 125 130 

tct gat acc aac egg gag gee ctg gaa gaa ttt aag aaa ttg gtg cag 
Ser Asp Thr Asn Arg Glu Ala Leu Glu Glu Phe Lys Lys Leu Val Gin 
135 140 145 



cgc aag gga etc teg gag gag gac att ttc acg ccc ctg cag acg gga 
Arg Lys Gly Leu Ser Glu Glu Asp lie Phe Thr Pro Leu Gin Thr Gly 
150 155 160 



534 
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10 

age tgc gtt ccc gaa cac taggcagccc ccgggtctgc acctccagag 582 
Ser Cys Val Pro Glu His 
165 170 



cccaccctac caccagacac agagecegga ccacctggac ctaccctcca gccatgaccc 642 
ttccctgctc ccacccacct gactccaaat aaag 676 



<210> 10 
<211> 170 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 10 15 

Ser Phe Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys 
20 25 30 

Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys 
35 40 45 

Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala 
50 55 60 

Thr Phe Thr Phe Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu 
65 70 75 80 

Met Arg Lys Thr Glu Glu Pro Gly Lys Tyr Ser Ala Tyr Gly Gly Arg 



Lys Leu Met Tyr Leu Gin Glu Leu Pro Arg Arg Asp His Tyr He Phe 
100 105 110 

Tyr Cys Lys Asp Gin His His Gly Gly Leu Leu His Met Gly Lys Leu 
115 120 125 

Val Gly Arg Asn Ser Asp Thr Asn Arg Glu Ala Leu Glu Glu Phe Lys 
130 135 140 

Lys Leu Val Gin Arg Lys Gly Leu Ser Glu Glu Asp He Phe Thr Pro 
145 150 155 160 

Leu Gin Thr Gly Ser Cys Val Pro Glu His 
165 170 



<210> 11 
<211> 782 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (537) 
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<223> CDNA2098 (782) - long form (hOBPIIb-beta) 
<400> 11 

cgc cca gtg acc tgc cga ggt egg cag cac aga get ctg gag atg aag 
Arg Pro Val Thr Cys Arg Gly Arg Gin His Arg Ala Leu Glu Met Lys 



acc ctg ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc 
Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe 



acc ctg gag gag gag gat ate aca ggg acc tgg tac gtg aag gec atg 
Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys Ala Met 



gtg gtc gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec 
Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser 



cca gtg aag gtg aca gec ctg ggc ggt ggg aag ttg gaa gec acg ttc 
Pro Val Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala Thr Phe 



acc ttc atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg 
Thr Phe Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu Met Arg 



aag acg gag gag cct ggc aaa tac age gec tgc ttg tec gca gtc gag 
Lys Thr Glu Glu Pro Gly Lys Tyr Ser Ala Cys Leu Ser Ala Val Glu 
100 105 110 

atg gac cag ate acg cct gec etc tgg gag gec eta gec att gac aca 
Met Asp Gin lie Thr Pro Ala Leu Trp Glu Ala Leu Ala lie Asp Thr 
115 120 125 

ttg agg aag ctg agg att ggg aca agg agg cca agg att aga tgg ggg 
Leu Arg Lys Leu Arg He Gly Thr Arg Arg Pro Arg He Arg Trp Gly 
130 135 140 

cag gaa get cat gta cct gca gga get gec cag gag gga cca eta cat 
Gin Glu Ala His Val Pro Ala Gly Ala Ala Gin Glu Gly Pro Leu His 
145 150 155 160 

ctt tta ctg caa aga cca gca cca tgg ggg cct get cca cat ggg aaa 
Leu Leu Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Pro His Gly Lys 
165 170 175 



get tgt ggg taggaattct gataccaacc gggaggcect ggaagaattt 577 
Ala Cys Gly 

aagaaattgg tgcagcgcaa gggactctcg gaggaggaca ttttcacgcc cctgcagacg 637 
ggaagctgcg ttcccgaaca ctaggcagcc cccgggtctg cacctccaga gcccacccta 697 
ccaccagaca cagagcccgg accacctgga cctaccctcc agccatgacc cttccctgct 757 



cccacccacc tgactccaaa taaag 



782 
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<210> 12 
<211> 179 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Arg Pro Val Thr Cys Arg Gly Arg Gin His Arg Ala Leu Glu Met Lys 
15 10 15 

Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe 
20 25 30 

Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys Ala Met 
35 40 45 

Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser 
50 55 60 

Pro Val Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala Thr Phe 
65 70 75 80 

Thr Phe Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu Met Arg 



Lys Thr Glu Glu Pro Gly Lys Tyr Ser Ala Cys Leu Ser Ala Val Glu 
100 105 110 

Met Asp Gin lie Thr Pro Ala Leu Trp Glu Ala Leu Ala lie Asp Thr 
115 120 125 

Leu Arg Lys Leu Arg lie Gly Thr Arg Arg Pro Arg lie Arg Trp Gly 
130 135 140 

Gin Glu Ala His Val Pro Ala Gly Ala Ala Gin Glu Gly Pro Leu His 
145 150 155 160 

Leu Leu Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Pro His Gly Lys 
165 170 175 



<210> 13 

<211> 542 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43) . . (297) 

<220> 

<223> CDNA2098 (542) - short form (OBPI lb -gamma) 



<400> 13 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 54 

Met Lys Thr Leu 
1 

ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 102 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 
5 10 15 20 

gag gag gag gat gag gga gga teg gtg cat cca gaa gaa aat cct gat 150 
Glu Glu Glu Asp Glu Gly Gly Ser Val His Pro Glu Glu Asn Pro Asp 
25 30 35 

gcg gaa gac gga gga gec tgg caa att cag cgc eta tgg ggg cag gaa 198 
Ala Glu Asp Gly Gly Ala Trp Gin lie Gin Arg Leu Trp Gly Gin Glu 
40 45 50 

get cat ata cct gca gga get gee cag gag gga cca eta cat ctt tta 246 
Ala His lie Pro Ala Gly Ala Ala Gin Glu Gly Pro Leu His Leu Leu 
55 60 65 

ctg caa aga cca gca cca tgg ggg cct get cca cat ggg aaa get tgt 294 
Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Pro His Gly Lys Ala Cys 
70 75 80 

ggg taggaattct gataccaacc gggaggcect ggaagaattt aagaaattgg 347 
Gly 
85 

tgcagcgcaa gggactctcg gaggaggaca ttttcacgcc cctgcagacg ggaagctgcg 407 

ttcccgaaca ctaggcagcc cccgggtctg cacctccaga gcccacccta ccaccagaca 467 

cagagcccgg accacctgga cctaccctcc agccatgacc cttccctgct cccacccacc 527 

tgactccaaa taaag 542 

<210> 14 

<211> 85 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 10 15 

Ser Phe Thr Leu Glu Glu Glu Asp Glu Gly Gly Ser Val His Pro Glu 
20 25 30 

Glu Asn Pro Asp Ala Glu Asp Gly Gly Ala Trp Gin lie Gin Arg Leu 
35 40 45 

Trp Gly Gin Glu Ala His He Pro Ala Gly Ala Ala Gin Glu Gly Pro 



Leu His Leu Leu Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Pro His 
65 70 75 80 
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Gly Lys Ala Cys Gly 
85 



<210> 15 
<211> 10664 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> hOBPIIa Gene 
<220> 

<221> modif ied_base 
<222> (8507) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (10635) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modif ied_base 
<222> (10643) 

<223> a, t, c, g, other or unknown 



<400> 15 

ctctgcttct atattcggcc tccaaatctg tgctcaatgc agcaactgga gtgacccctt 60 
aaatacgtaa gtcacagctt gcctttgtca gagctatcca gggtctttca ctcagagcag 12 0 
aagctgaagt cctcgtggtg gtccttaatc cctacatggc cgttccaccc actccccagc 180 
ctcatgtgtg gccggtctcc ctggatcatt tgctgtggct gctctgccgt gtgttcccgg 240 
aaactgccag cgttctccca cctctgggct ggcactggat gctcctgcca cctggactcc 300 
tcttccagct gacgagctca tggcttgctt ccttcatgtc ttaaattcgg tgtttgaatg 360 
ccaccttggc gaggctgttc ctcatcaatt catgtaaagg acaaataacc ccttgtctgg 420 
ccactcctgt ccaacttgtc cactttgctt tttccatagc actgatcacc atttaaaata 480 
atgtatgaac cagtgaccca tcagtgacca gagacagtag attgaacagg tattgcttgc 540 
acagtgaaca atcagctcta gaggtttaaa gagaaccaca aagaatggct caggattgca 600 
cgagggcagt taaggaagaa ataaacaggg gtggggatcc tttccacaag tgtgggttca 660 
gaccgcatgg gagaaggtgt ggttctccca agggaagctg gagaagtttg ctgggttgtc 72 0 
ccagccacag ctggcccacg gtcagggcag aggccagcag gcagggaaat gtaggctggg 780 
tctggcaagc aggaagcctc tcttcccccg caaggcaggt gggctggggc tgggaagcct 84 0 
acaagagtca ctgggaaccc acaggtgcag atgccactga atctcaatag gaagccatct 900 
gggggggtcc cctgaattca atgtggtgtg tgccgccaga cgtccccaac ttgtgccact 960 
gccatttata caggaagaga aagaaaaagg aagaaatgga agcatctgga accagtcatc 1020 
ctggacccat gctaggaggt gcttccccct acgcctcaac caacaaagct tagcattgca 1080 
ccaggtgcac aagagaagcg cctgctggct ccagctccat tgcctcggaa ccagccatga 1140 
agggtgcgtg tggagctgga ggcaagacat tgatagctgg cactgcaatt cacttattta 12 00 
ttttgttcat tttaagtccc ctgcacctag aatataagcc ccccaagcac aggacatttg 1260 
ttttattgat cgatgtattc cttgtgcccc aaagaatgag aggcatctag aaagtctgca 1320 
aaaatcaaac ataaaaatga acctttattc agtcattgtt attttgatgg atatttgagg 1380 
catttccaag atttgtaagt aacaattgaa cccttctcct ggttcatgtg tgggagtgta 1440 
tctgttcaaa tgatgcttct gagtggagtt gctgagtctt tggctctagg tttttttttt 1500 
ttaagcattt atgctcattg tggtttttaa attaaacatt taaccctgag acactgtaga 1560 
ttcccatgca attgtaaaaa gccacacaga gctatcatgt gtatcttcac ctggcttgct 1620 
ccagccccaa ccccagcaat gacagacctg ttctccactc ctgcaatctg ctcattgcaa 1680 



gaatgtcgtc tattgcaatc ataaaattgt 
cattctctgg agattcatcc tattgttgca 
gagtagtgct ctatggtatg gatgtaccac 
agtctgtgtt tatagatttg ggctatgaca 
ttcatttccc tgggacaaag gcccaggggt 
gctcattttg ttttgttttg ctttgtttta 
agatccagtt tccatgcatc ctcaccaggc 
ccaccttaat aggtatgtac tgatatatca 
atggtgttga gcatcttttc atgtgtttat 
gtcccttcat gtcttttgtt tacgttctat 
ttataaagta tagaaactaa ttctttgttg 
tctttggctt gtctttttat tctgttaaca 
ttgatgaagt ctattttaac aatttttcct 
aaaacctcct gacccagctc catatgtcaa 
ttttaagttt tatgttgaag tctatgatcc 
gaccaaggtt gtggttcttc tttttctttg 
cagcccattt gctacaaaag ctatctttcc 
aatgtaattg ggtgtatttg tacaggtctg 
atctatgcat ctgtccatgt gctagctata 
tgtggtggcg agagcctgta atcccagcta 
aacccaggag gtggaggttg cagtgagtca 
acagagtgaa ctccatataa aaaaataaaa 
cttattcttt tttaaaaaag ttttagctat 
tagaataatc ttgtctatat ctaccacaat 
acatctttat attatttgag agaaatgata 
gatataacgt ctctccattt gtttagatct 
ttttcagcat acaaatccag catatgggtt 
ttagccatta taaatagtac tgtgttttta 
cacaattgat ctttgtatat ttatcttgtg 
ttctaggagg tgtattgttt tcttttgttt 
agataataat gtcatctgca gatgcagttt 
aatttccttt ttaaaaaacc tatattactc 
caagtggtag agtggacatc cttgccttgt 
tgagtatccc agcctcaaaa tgtgcttaaa 
aaacatactt cgaaactctt tatttctccc 
tcgcttatct aattatgtgc ttacttagaa 
gcagagagac ccagctgcag aatcctccct 
tcccggctga aatcaggatg acgcaaacca 
ataaccatca gaacaagatg cagaccaaca 
caccttttcc tccttaaacc ccttcgctca 
catgggtctg gccattcccc aactgccagt 
acctcacttc tcatgccttg acttctgagc 
atcagtcttt caccattaga aataatgtag 
ttcaagttaa agaagttctc ttctattcct 
tgttgaattt tgttaaatac ttttaacaac 
gagaaacgca ccaggccaca aagcactggg 
tgggttgacc cccagcctcc aaccctacca 
caaccagcat ctaaaagaac acagttggtg 
ggacaatggg aggctcctgg agagcacccg 
cctgttccct gaagcatctg cgcgtgttgt 
agcaccagtg tcatacggtg cttaaatcaa 
tgccggctgg gcgcggtggc tcacgcctgt 
cggatcacga ggtcaggaga tcgagaccat 
ctaaaaatac aaaaaaaatt agccgggcgt 
gggaggctga ggcaggagaa tggcgtgaac 
tcccgccact gcactccagc ctgggcgaca 
aaaaaaaaaa aaaagaaata caaggtgcct 
gaaatcttcc cgttgaggta gagactccag 



gggattggct ttttttttcc tgtgcagcat 1740 
tttatcaata gtttattcca ttttacttct 1800 
agtctgttta accattcacc tgttggagga 1860 
catgtatagg tttttgcatg gacatcagtt 1920 
tctattgctg gattctatgc ttgttacagg 1980 
ttttgttttt aacctgtcaa gccattttcc 2040 
ttcagtatga tcactatgat cttatctcag 2100 
tggcttttat ttgcatttca ctgatgacta 2160 
ttgccatctg tatatcctct ctagtcaagt 2220 
ttttgaaact gttgagtttt gaaaaattct 2280 
aatatgtagt ttgtcaatat tttctttcag 2340 
gggtctctta cagagcaaaa ggtttttatt 2400 
tttatggatc atatttttgg caacaaatct 2460 
agattttctt gttttctaaa agttttatag 2520 
attttgagtt aattttcata cagggtgtga 2580 
gtggttttgt ctgtggatgt ccagttgctc 2640 
tccactgaat tacttttgca tctttgtaaa 2700 
tttgaggatt ctttattgtc tgtcccattg 2760 
taagtcttga aagggtagcc tgtagctggg 2820 
ttcgggaagt ggaggcagga gaatcgattg 28 80 
agatcgtgcc actgcactcc agcctgggtg 2940 
acaaaaataa agtagcgtga tttctcccac 3000 
tccagttcct ttgcctttcc atataaattt 3060 
tctttctgga attttgatag aaattttgtt 3120 
tttttactat gttgagtctt ctaatccatg 3180 
tctttgattt attttataat cattgcattg 3240 
gttagactta tgcctaggca tctcattttt 3300 
agtttagggt ccattactag tatgtagaca 3360 
tcctgcaacc ttgctgaact cacttactag 3420 
tgtttttcaa tttcttggga ttttctacag 3480 
tcttccttcc tttccaattt gtatgtcttt 3540 
tgactagaac tttctgtact atgttaaata 3600 
ccctgatgtt aaagagaaag catttgtaac 3660 
aacttttttc ctttcttgct ttcagccttg 3720 
tttcccacca ggcacttccg tgagcagtgc 3780 
attccagggg ccaattttga aacaaaccag 3840 
cttaggggga gttacaggta gcctaccact 3900 
gacctccgga cagacgattg atgactcaca 3960 
tcctcctgca ccattcccac atatttccca 4020 
gtccagaaaa tctgaatggt cttttaaagg 4080 
atttgaataa agctgctttc cttttaccac 4140 
agcgagcagc tggacttgag ccagttacac 42 00 
ccataggttt ttttttcgta gatgttcttt 4260 
atttttctga gaggtgttat cccgaatgag 4320 
ccaacaggaa ccaccatcag aagccatcca 4380 
ggccagggat cttgcccctg ctgtctgcca 4440 
tcccctgacg gtgtctgcag cagttgaacc 4500 
aacgagactg ggacacaggg caagatgggt 4560 
taccagcgag catagaattc atgggggtga 4620 
tccagcattt tcttcaagga ttgagccagc 4680 
tgattcacag ccaaccaatg aaatacaagg 4740 
aatcccagca ctttgggagg ccgaggcggg 4 8 00 
cccggctaaa acggtgaaac cccgtctcta 4860 
agtggcgggc gcctgtagtc ccagctactt 4920 
ccgggaggcg gagcttgcag tgagccaaga 4980 
gagcgagact ccgtctcaaa aaaaaaaaaa 5040 
ggggtacagg caacaaaaat ggggaacgga 5100 
cctgtttgct catctctggt cctgaagagc 5160 
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ccagtccccg ccctaaggat ggggtttctg ctcggcagca cttgccgtga gaggggtgag 5220 
gcactgggtc acgccagccc tttctttata gcccccaggg tttattagag tgtgcattag 5280 
tattatttag caagcacttg agggtgtctg atcttgggcc agagaggcac aaagattgtg 5340 
tgggcccagc acctgcctgc agagcgtggg tcagccttgg gtcccagggc agatgacgca 54 00 
ggcccgggaa ctacagggtc caggagggag aaggcaaagt tctggctcag gttggctggg 5460 
gatgaggcca gcggagccag gtgcccaagg gagctcagcc acaaactctg agcacaggct 5520 
ggcaggtggc tcttgatgct catgccaccc atttatctaa agggatgaga ttcaaggcct 5580 
gtcctggtgg ctgggggccg tcgaagctga cgagagaggg gatgtagagt gaatgtatat 564 0 
tccactctac cacttgtatt taacagggag atggatgatg aacacgtgca ggaggaaaca 5700 
ggcaggacaa tccagagaga tcacgtgttc tgaggacagc acagccaggc tccggtacgg 5760 
agtgaagcgg ggtggggcag gcggcggggt ccctcatatg gcccgaggag gccgtatata 5820 
tactgacctt gagccacaca atagtgccct tctctgcccc taggaagctc gagtgcaggg 5880 
tccaggtggg gaaaatcaat gcagagtggg tcccagagtg ggcggaagct tgggctctag 5940 
ggcgtgcggg actcagctgg cagcagcccc acatctctat ggttctaaag cccagtccat 6000 
ttctgctcag cagggagatg gccagttccc cagaggactt gcccagggtc ccagccgtgg 6060 
ccctgggagg gctcaggggt gagggaggga gaattccgag ccgtgggccc tccctctgtg 6120 
gctcggggtg cagacgtgca ctaccctgcc ctgtcctctg gagtcctccg ggctcatcca 6180 
agtgggcgca tttgggtgca aaccctggac agcatcgtgt gtcttttctt ctctggctgg 6240 
cactgaattg ctgttcacca gatgccagca atgaggacac tccccctccc ggatgcagga 6300 
ctggttgcca ccatggggag gggtgcaccg tgccacgtgc tcccaggaca ctggccaggg 6360 
ggctataaag aacatctcga gaggagccag cacagccttg ttcagacgcc cagtgacctg 6420 
ccgaggtcgg cagcacagag ctctggagat gaagaccctg ttcctgggtg tcacgctcgg 6480 
cctggccgct gccctgtcct tcaccctgga ggaggaggat gtgagctggg ttggcgtggg 6 54 0 
cggatggagg agccaggtgg actcctgggc agggggcagt gccaggggcc ctgctttagg 6600 
aggtgtcact taagcctggg gtggtggtgg aggggtccta ttgttccttc agcagacaat 6660 
gctccccatg aggcccaggg ccggagcagg ctcggctcag gggttcctgc tgcactgacg 6720 
cctgaagccc gaaggtctcg cagggttggg ccctgtggag ggagggctca cctggtgctg 6780 
gggcccgggg gtccatgggg tgcagacatg ccctccttcc actgggggct gggagccctg 6840 
agcagggggc tggctctaac tcactccagc tgagctctaa ctaaggtgca ggaacccagc 6900 
ctgcctttag gggtggcagc cgggcaccat gggtgtctgg ttatagctgc aggcctgagt 6960 
gccagggtca gagtagaatc tgggccaccc atggtgggct cacggccttg gcctgctcca 7020 
gatcacaggg acctggtacg tgaaggccat ggtggtcgat aaggactttc cggaggacag 7080 
gaggcccagg aaggtgtccc cagtgaaggt gacagccctg ggcggtggga acttggaagc 714 0 
cacgttcacc ttcatgtgag tgttgcccac tgcagggccc ctcaggccac tttcgctccc 7200 
cgccccagac ccacctggtg cccattgccc catccacatt tcgggtgttg ggaagagtca 7260 
ccccctgcct tggagggaaa cagccagggc atcctgaagc tcggtggggt gggggggcag 7320 
tggaattttc aggttgccgg gtcagggcca tgcaccaggt gagctgagga tgggccaggt 7380 
gtgtcctggg agccgctgcc cgcgtgtctc ctgttttcca ggagggagga tcggtgcatc 7440 
cagaagaaaa tcctgatgcg gaagacggag gagcctggca aatacagcgc ctgtgagccc 75 00 
ctccccgacc ccccactccc catgcccaac cccggatgca ccagccccac tgcaggtgga 7560 
gagtgcccag gccacacttc tgccagggtc ccagccctgc ccacctccaa ggaggggctg 7620 
gcctctcctt cctggggggc tggtggccct gacatcagac accaggtgtg acaggcttgt 7680 
ccacagtaga gatggaccag atcaagcctg ccctctggga ggccctagcc attgacacat 7740 
tgaggaatcc gagtgttagg gaccaggagg ccgagggtta gggatgggaa gccaaggctg 78 00 
agggtttggg atcaggaatc cgagggttag ggacagggaa gtggggcagg agcagctgct 7860 
ggagctggga aggccggact ctagtcctgg acgtgctctg gccttgtggc tccattactt 7920 
gcattgggac cttccgagag gaggctcctg cctccgtgtc cgggtccatg ctgtgcggag 7980 
cagccaggcc tggctcaggc tgtccagggc acctgggtga ccactgaaac attcctgagt 8040 
gtttcttcgt gtggtcctga gtgctctctc cgggaatgag ggcactgaag acccatcttc 8100 
tctgtcatct acagatgggg gcaggaagct catatacctg caggagctgc ccgggacgga 8160 
cgactacgtc ttttactgca aagaccagcg ccgtgggggc ctgcgctaca tgggaaagct 8220 
tgtgggtgag gggcccgctg gggcctgcat gtcctgcccc atggtctctg cctccagaag 8280 
ccagtggaac caccatcatc acgccctggc acggggggaa aaggaagccc cctgcgccgg 8340 
ccttcgtgtg ctaggcacca agcgctgccc tggatggctg gtccaagttc ctgaagtggg 84 00 
agtggggtgg gccaggcagg gacagacacg gccctcggtg acgtgaacct gccaagggcc 84 60 
gcttgtgggg tctcaggtgt aggggcctca ccttaagggg gaggtancat cttaacagag 8520 
ctcttcatgg ggcagggact ctccagggcg gcagggcagc cagtgcctct gggacacaag 8580 
gtccctccag gtgagggttg tgaccctgca gagtggcttt gggagctgcc caggtccccc 8640 
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tggggttgct gagtggcttg gaccctgcca 
ggcggtggcc gtctcctctg tccccagtcc 
tcctctcccc cttgcctggt gctggacagt 
tgagctctga tccactctcg ggcctctccc 
ttttagcatc tgctccctgc agggccgtgc 
ctcacctgca ggtaggaatc ctaataccaa 
ggtgcagcac aagggactct cggaggagga 
cggctgtgcc cagtaccccg tgttcccctg 
ccctcgtgtc ctcccatgtc ccccgcattc 
tctgggccct gcttagcatc ctcgtcgttg 
ctggggctcc gcgctctggg ctgcgatgcg 
gggtctgggg ctccgcgctc tgggctgcga 
gatggggtct ggggctccgc gctctgggct 
ctgcgatggg gtctggggct ccgcgctctg 
tgggctgcga tggggtctgg ggctccgcgc 
gctctgggct gcgatggggt ctgggccctg 
tcttggtctc tgcaggaagc tgcgttctcg 
cactggacaa gcccagagtc ctgggttccc 
tcccatccca cacagccagg gagacccccc 
gtctgcccac ccaaaattcc tgggacattc 
ctgccagccc gagtgagggt cctcctcggc 
ctcaggtgtc agctcggccc atcgccccct 
cacccactca ccaaggatgc tcagaggcct 
tggagccccg aggctgccca gcagtggccg 
ctctgggcct ggcctctgcc ctaccctgca 
acagcagccc ccgggtctgc acctccagag 
ccacctggac ctaccctcca gccatgaccc 
aaagagcttc tcccccagct ctgggcaggc 
gctccctggg aggcctgctg ggagggggag 
tagcaagggc caaaccacaa gaggcacgat 
ctggaggggc ctccggaaga gccgccctcg 
ctctgggcag gagcacttgg gaagccactg 
cctgcatcag tgggtccacc acagggctcc 
gcgggtccac tttancatcc tgngctcagt 
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ctgtcccctt tcctggggac ctctcacctg 8700 
cacccctgag ctcttgtcca ttctcaggcc 8760 
tgccatctct tctgtcccca gccccacccc 8820 
ccgtcctgat gctgggccgt ggtcgtctcc 8880 
cgctgtcccc acgtcggctc acctggccac 8940 
cctggaggcc ctggaagaat ttaagaaatt 9000 
cattttcacg cccctgcaga cgggtgagga 9060 
tgtctctgtg tgatctccag tgtcccatga 9120 
cccatgtgcc ccgagtctcc tcgcaggggc 9180 
gagggtctgc actctgggct gcgatggggt 9240 
gtctggggct ccgcgctctg ggctgcgatg 93 00 
tggggtctgg ggctccgcgc tctgggctgc 93 6 0 
gcgatggggt ctggggctcc gcgctctggg 9420 
ggctgcgatg gggtctgggg ctccgcgctc 9480 
tctgggctgc gatggggtct ggggctctga 9540 
gtctagggcg ccttctaatc cctgggtttt 9600 
aacactaggg tgagtgagcc tttaggaggg 9660 
ggggttcgag ggtacatctg ctctggccct 9720 
cagggtcagg cacgaggttg gcacctcaga 9780 
gggaagtcct ttgttttacc attcctgcac 9840 
ctttccacag cgaggcctcc ctccggctcc 9900 
gcacctgtcc ccactcggtc tactcccccc 9960 
gcccagtcat tggaggagct gaggctgctc 10020 
atgtggaagc tgcagagcct ggggagggag 10080 
gcctccctga cctctgctcc tcttcccagc 10140 
cccaccctac caccagacac agagcccgga 10200 
ttccctgctc ccacccacct gactccaaat 10260 
ctatctgtgg ggacggaggg gctcgcaccg 10320 
ccacaaggga agctggggag atgttggacc 10380 
gtccaacagg ctgtggggct gcgtgatgtc 10440 
gatctcaggc tcagcttggg acgggcaggg 10500 
ggagggccgg agtggggaca cggcgtcaga 10560 
cgccccgggg gtctcagtgg gatcctccga 10620 
ggcatcactg gata 10664 
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tctctgcttc tatattcggc ctccaaatct gtgctcaatg cagcaactgg agtgacccct 60 
taaatacgta agtcacagct tgcctttgtc agagctctcc agggtctttc actcagagca 120 
gaagctgaag tcctcgtggt ggtccttaat ccctacatgg cagttccacc cactccccag 180 
cctcatgtgt ggccggtctc cctggatcat ttgctgtggc tgctctgctg tgtgttcccg 240 
gaaactgcca gcattctccc acctccaggc tggcactgga tgctcctgcc acctggactc 300 
ctcttccagc tgacgagctc atggcttgct tccttcatgt cttaaattcg gtgtttgaat 360 
gccaccttgg cggggctgtt cctcatcaat tcatttaaag gacaaataac cccttgtcct 420 
gacactcctg tccaacttgt ccactttgct ttttccatag cactgatcac catttaaaat 480 
aacgtatgaa ccagtgaccc atcaggatcc agagacaata gattgaacag ggattgcttg 540 
cacaatgaac aatcagctct agaggtttaa agagaaccac aaagaatggc tcaggattgc 600 
acgagggcag ttaaggaaga aataaacagg ggtggggatc ctttccacaa gtctgggttc 660 
agacctcatg ggagaaggtg tggttctccc aagggaagct ggagaagttt gctgggttgt 720 
cccagccaca gctggcccac ggtcagggca gaggccagca ggcagggaaa tgtaggctgg 780 



gtctggcaag caggaagcct ctctccccac 
tgcaagagtc actgagaacc cacaggtgca 
tSSgggcggt cccctgaatt caatgtggtg 
ctgccatttg cataggaaga gaaagaaaaa 
tcctggaccc atgctaggag gcgcttctcc 
caccaggtgc acaagagaag cgcctgcttg 
gaagggtacg tgtggagctg gaggcaagac 
tattgtgttc attttaagtc ccctgcacct 
tgtttcattg atcgatgtat tccttgtgcc 
caaaaatcaa acataaaaat gaacctttat 
ggcatttcca agatttgtaa gtaacaattg 
tatctgttga aatgatgctt ctgagtggag 
ttttaagcat ttatgcttat tgtggttttt 
gattcccatg cagttgtaaa aagccacaca 
ctccagcccc aaccccagca atgacagacc 
aagaatgtcg tctattgcaa tcataaaatt 
catcattctc tggagattca tcccattgtt 
tctgagtagt gctctatggt atggatgtac 
ggatgtctgt gtttatagat ttgggctatg 
gttttcattt ttctgggaca aaggcccagg 
agggtttttt tttttaaacc tgtcaagcca 
cctcaccagg cttcagtatg atcactatga 
ctgatatatc atggctttta tttgcatttc 
catgtgttta tttgccatct gtatatcctc 
ttacattcta tttttgaaac tgttgagttt 
attctttgtt gaatatgtag tttgtcaata 
ttctgttaac agggtctctt acagagcaaa 
caatttttcc ttttatggat catatttttg 
ccatatccca aagattttct ttctgttttc 
taagtctatg atccattttg agttaatttt 
cttctttttg tttggtggtt ttgtctgtgg 
aaagctatct tccattgaat tgcttttgca 
tgcaggtctg tttgaggatt ctttattgtc 
gctagctata taatgagtct tgaaagggta 
gtaatcccag ctacttggga ggcggaggca 
ttgcaatgag tcgagattgt gctactgcac 
ctctaaaaaa aagaaagaag aagaaaagga 
attcttaaaa aaaaaagttt tagttattct 
aataatcttg tctatgtcta caaaaattcc 
tctttatatt atttgagaga aatgacgctt 
atagcacatc tctccatttg tttagatctt 
tttcagcata caaatccagc atatgggttg 
tagccactat aagtagtatt gtgtttttaa 
acaattgatc tttgtatatt tatcttgtat 
tctaggaggt gtattgtttt cttttgtttt 
gacaatcatg tcatctgcag atgcagacag 
tttaatttcc tttttaaaaa acctttattg 
atacaagtgg tgaaagtgga catccttgcc 
taactgagta tcccagcctc aaaatgtgct 
cttgaaacat acttcgaaac tctttatttc 
gtgctcgctt atctaattat gtgcttactt 
ccaggcagag agacccagct gcagaatcct 
cacttcccgg ctgaaatcag gatgactcaa 
catgataacc attggaacaa gatgcagacc 
cccacacctt ttcctcctta aaccccttcg 
aaggcatggg gcctggccat tcctctactg 
accacacttc acttctcatg ccttgacttc 
tacacatcag tctttcacca tgagaaatga 



ccaaggcagg tgggctgggg ctgggaagcc 84 0 
gatgccactg aatctcaata ggaagccatc 900 
tgtgccgcca gacgtccccg acttgtgcca 960 
ggaagaaatg gaagcatctg gaaccagtca 1020 
ctacgcctca accaacaaag cttagcattg 1080 
ctccagctcc attgcctcgg aaccggccat 1140 
attgatagct ggcactgcaa ttcacttatt 1200 
agaatataag ccccccgagc acaggacatt 1260 
ccaaagaatg agaggcatct agaaagtctg 1320 
tcagtcattg ttattttgat ggaaatttga 1380 
aacccttctg ctggttcatg tgtgggagtg 1440 
ttgctgagtc tttggctcta ggtttttttt 1500 
aaattaaaca tttaaccctg agacattgta 1560 
gagctatcat gtgtatcttc acctggcttg 1620 
tgttctccac tcctgaaatc tgctcattgc 1680 
gtgggattgg cttttttttt tcctgtgcag 1740 
gcatttatca atagtttatt ccattttact 1800 
cacagtctgt ttaactattc acctgttgga 1860 
acacatgtac aggtttttgc atggacatca 192 0 
ggttctattg cggggttcta tgcttgttgc 1980 
ttttccagaa actgtccagt ttccatgcat 2040 
tcttatctca gccaccttaa taggtatgta 2100 
actgatgact aatggtgttg agcatctttt 2160 
tctagtcaag tgtcccttca tgtcttttgt 2220 
tgaaaaattc tttataaagt atagaaacta 2280 
ttttctttca gtctttggct tgtcttttta 2340 
aggtttttat tttgatgaag tctattttaa 2400 
gtgacacatc taaaatctcc tgacccagct 2460 
taaagctttt atagttttaa gttttatgtt 2520 
cttgcagggt gtgagaccaa ggttgtggtt 2580 
atgtccagtt aatccagccc atttgctaca 2640 
tctttgtaaa aacttaactg ggtatatgcg 2700 
tgtcccattg atctatgcat ctgtccatct 2760 
gcctgtagct gggtatggtg gtgtgcatct 2820 
ggagaatcac ttgaatctgg gaggtggagg 2880 
tccagcctgg gtgacagagc aagactctgt 2940 
aagaaagggt agcctgattc ctcccacttt 3000 
agttcctttg cctttccata taaatgttag 3060 
ttctggaatt ttgatagaaa ttgtgttaaa 3120 
ttattatgtt gagtctccta atccatggat 3180 
ctttgattta ttttataatc attgcattat 3240 
ttaaacttat gcctaggtat ctcttttttt 3300 
gtttagggtc cgttactagt atgtagacac 3360 
cctgcaacct tgctgaactc gcttaatagt 3420 
gtttttcagt ttcttgggat tttctacaga 3480 
ttttcttcct tcctttccaa tttgtatgtc 3540 
ctctgactag aactttctgt actatgttaa 3600 
ttgtccctga tgttaaggag aaagcgtttg 3660 
taaaaacttt tttcctttct tgctttcagc 3720 
tccctttccc accaggcact tctgtgagca 3780 
agaaattcca ggggccaatt ttgaaacaaa 3840 
ccctcttagg gggagttaca ggtagcctac 3900 
accagacctc tggacagacg attaatgact 3960 
aacatcctcc tgcaccattc ccacatattt 4020 
ctcagtccag aaagtttgaa tggtctttta 4080 
ctagcatttg aataatgctg ctttcctttc 4140 
tgagcagcga gcagctggac ttgagccagt 4200 
tgttagccat aggttttttg tagatgctct 4260 
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ttgtcaagtt aaggaggttc tcttctattc 
cgtgatgaat ttagtcaagt actttttctg 
tcttaacata ttaatatggt tgatttttga 
aactgctttt ggtcatggtg tagaattaat 
aaaggatttc tgtatctata ttcatgaggt 
gtcaggtttt ggttgggaga tattctctca 
ttatttttta aatggttggt agaattttct 
tttggaagct tttaaaatta aaatgttaac 
tatcttatat tggcattgtg ttttttgaga 
atatgtggga aggtctcttc gtagtattcc 
tagtgatagt ctctgttttg ttccagatgt 
tttgtcagtc tttctagaac tgcctcaatt 
tttcattgtt tttctctgtt gtttttctgc 
ttattaattg ctgctcttat ctttgttatt 
tcttcttttt ctaggttctt gaagtgggag 
ctaatatatg catttaatgc ctctcagcac 
tgtttttgtt tgtttgtttt tgaggtagag 
tggcactatc ttggctcact gcaacctcca 
agccccctaa gtagctggga ttacaggtat 
atttattttt gtatttttag tagagatgag 
actcctgacc tcaagtgatc tgcccacgtg 
agccactgtg cccagcctgt ttttgttttt 
gctggagtgc agtggcatga tagcccactg 
tcctgcctca gcctcccgat tagctgggat 
ttttgcattt ttggtagaga tgggttttcg 
gatcgcaagt gatccacccg cctgggcctc 
tcacgcccag ctgatatgtt gtattttcat 
accttcaggc ttcctctttg acctgtggat 
gaagaaagat tttccattat ctttctggca 
agactaccct ctatatcatt taaattttta 
ggatatggtc tatcttggta tatattctgg 
tgccaggtgg tgtgttctac aaatgtcaat 
tctttgatgt ccctgctggt tttctgtcta 
caactataat tgtggctttt taaatttctc 
gcagctaatg tttggtgcac acacgtttag 
acttattatt gtataatttc cctctctgtt 
ctgttattga aataatgctt ctgctttctt 
tttttacttt caacctgtct ctattgttac 
tagttgtata atggctttta attcactttg 
ttacatttaa tataattatt aacatgctag 
acaaaatact accaactggg tggcttagaa 
cagaagtctg agatcaagtg agatttggaa 
tgttaccatt gtgggcaact gggactcagt 
gaaccatatt tgattgttcc acttagaagg 
ctatccctac agactgagga tgctgattca 
ggaagcacag aagaaaaggt gcaggcactt 
gggctctagg gcgtgcggga ctcagctggc 
ccagtccatt tctgctcagc agggagatgg 
cagccgtggc cctgggaggg tgcaggggtt 
tgcctctgtg gcttggggtg cagatggaca 
ggctcatctg agtgggcgcg ttcgggtggt 
cttctctggc tggcactgac ttgctgttca 
cccggatgca ggactggttg ccaccatggg 
acactggcca gggggctafca aagaacatct 
gcccagtgac ctgccgaggt cggcagcaca 
gtgtcacgct cggcctggcc gctgccctgt 
gggttggcgt gggcggatgg aggagccagg 
gccctgcttt aggaggtgtc actgaaggct 



ctacttttct gagagttgtt ttcctgaatg 4320 
cattgatcga tatgatcatg taatttttct 4380 
gtattgtacc aggtttccat ccctggaata 4440 
tttatatatt attatttgct aatattttgt 4500 
atattgttct gcagctttcc tttttgtgtt 4560 
tcttctgaga ttatgcagag ttggtgttat 4620 
aatgaaacca tatggacatg aagaattatt 4680 
tttaattgtc atagagctat taaagttatc 4740 
aattggttca tttcacctat gttgtcaaat 4800 
cttattatcc tttgatgtct gcagggtctg 4860 
tggttatttg tgtctttctt tctttttttg 4920 
ttattaattt ttccagagaa ctaacccttg 4980 
tttcgatttt ttctgctttc tgcttttgat 5040 
tcttttcttc ttgctttggg tttattttgc 5100 
cttagatagt tgattttgga cttctctttt 5160 
tgatttagct gtgtctcaca aattttgata 5220 
tctcgctctg tatcccaggc tggagtgcag 5280 
cctcctgggt tgaagtaatt ctcctgcttc 5340 
acgccaccat gcccagcaat ttatttattt 5400 
gttttgtcat gctgtccagg ctggtctcga 5460 
gggctccaaa gcactgagat tataggcata 5520 
tttgagatgg agtcttgctc tgtcacccag 5580 
caacctccgc ctcccgggtt caagtgattc 5640 
tacaggcgtg caccaccaca ctcagctagt 5700 
ccatgttggc caggctggtc tcgaattcct 5760 
ccaaagtgct ggaattatag gcacaagcca 5820 
tttcatccag cctagtatat ttttaaattt 5880 
tattcacagg tgtgttgttt ccaagcatct 5940 
ttgatttata gttagattcc atgtggtcag 6000 
aaaatttctt gaggctcatt ttctcatcca 6060 
tgcacttgaa aagaatgtgt gtagtgctgt 6120 
tcgattatgt taattgatgg gtggccgagt 6180 
gttgctctga gagaagggta ttgaagtctc 6240 
ctttcagttc tgttttgctt cacatattgt 6300 
gattactatg tcttctttgc agaatggccc 6360 
aattagtctt tgttctaaag tctattttac 6420 
ttgattaatg tttatatatc ttttttcatg 6480 
atttgaagta agtttcttgt agacggcata 6540 
ccaatcactg tcatttaata tttaaaccat 6600 
agtttgtatt atttttatag ggttgctata 6660 
caaaaattta ttttctcaca ttctggtggc 6720 
ggtagaaaaa ccaacaaaat atgttaatga 6780 
ccttctgagg gccatctgag aaatagggtg 6840 
aagagtccag accatttatc ccatcagtta 6900 
cttgcacatc tgggtttcat cagtgtcagg 6960 
aaggtgggaa gctgtgaata cgtccgtgca 7020 
agcagcccca cgtctctctg gttctaaagc 7080 
ccagttcccc agaggacttg cccagggtcc 7140 
ggtgagggag tatcccgagg ctgtgggccc 7200 
ccaccctgcc ctgtcctctc gagtcctctg 7260 
gcaaactctg gatggcatcg tgtgtctttt 7320 
ccagatgcca gcaatgagga cactccccct 7380 
gaggggtgca ccgtgccacg tgctcccagg 7440 
cgagaggagc cagcacagcc ttgttcagac 7500 
gagctctgga gatgaagacc ctgttcctgg 7560 
ccttcaccct ggaggaggag gatgtgagct 7620 
cggactcctg ggcagggggc agtgccaggg 7680 
ggcttctgac cccgtgctcc cagcctgggg 7740 
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tggtggtgga ggggtcctat tgttcctcca 
gagcaggctc gtctcagggg ttcctgctgc 
ggttgggccc tgtggaggga gggctcacct 
agacatgccc tccttccact gggggctggg 
ctccagctga gctctaacta aggtgcagga 
gcaccatggg tgtctggtta tagctgcagg 
gccacccatg gtgggctcac ggccttggcc 
aggccatggt ggtcgataag gactttccgg 
tgaaggtgac agccctgggc ggtgggaagt 
tgcccactgc agggcccctc aggccacttt 
attgccccat ccacgtttcg ggtgttggga 
cctgggcatc ctgaagctcg gtggggtggg 
gggccatgca ccaggtgagc tgaggatggg 
tgtctcctgt tttccaggag ggaggatcgg 
acggaggagc ctggcaaata cagcgcctgt 
cccccaccgc caaccccagt gcaccagcct 
ttgccagggc cccagctctg cccacctcca 
ctggtggccc tgacatcaga caccgggtgt 
gatcacgcct gccctctggg aggccctagc 
gacaaggagg ccaaggatta ggtacgggga 
gttaaggatg gggaagctga gggttaagga 
aagggttagg gatcgggaag ctgagggtta 
gaggccgagg gttagtgctg cggaagctga 
atgagggtta gggacaggga agtggggcag 
tctagtcctg gacgtgctct ggccttgtgg 
aggaggctcc tgcctccgtg tccgggtcca 
gctgtccagg gcacctgggt gaccgctgaa 
gagtgttctc tctgggaatg acccattttc 
catgtacctg caggagctgc ccaggaggga 
ccatgggggc ctgctccaca tgggaaagct 
gtcctgcccc acggtctctg cctctggaag 
a tgggggaaa aggaagcccc ctgtgccggc 
gtccaagtac ctgaagtggg aatgggatgg 
acatgaacct gccaagggcc acttctgggg 
gagggtcacc ttaagcgtcc taagagagct 
agggtggccg gtgcctctgg gagacaaggt 
gtggcattgg gagctgccca ggtcccctgg 
actgtccccc ttcctgggga cctctcatct 
ccacccctga gctctcatcc attctcaagt 
tggccatctc ttctgtcccc ggccccaccc 
ccttgtcctg gtgctgggca gtggccatct 
gatccactct caggcctctc cccctgtcct 
atctgctccc tgcagggccg tgccgctgtc 
gcaggtagga attctgatac caaccgggag 
cgcaagggac tctcggagga ggacattttc 
gcccagtccc ctgtgtccct ctgctgtgtc 
catgtcctcc catgtccccc gcattcccca 
gggccctgtt tagcgtcctc ctcattggag 
gggctccgcg ctctgggctg cgatggggtc 
tctggggctc cgcgctctgg gctgcgatgg 
ggggtctggg ccctggtcta gggcgccttc 
gaagctgcgt tcccgaacac tagggtgagt 
gagtcctggg ttcccggggt tcgagggtac 
ccagggagac ccccccaggg tcaggcacga 
gttcctggga cattcgggaa gtcctttgtt 
agggtcctcc tcggcctttc cacagcgagg 
agcccatcgc cccctgcacc tgtccccacg 
gatgctcaga ggcctgccca gtcattggag 
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gcagacacgg ccccccgagg cccagggccg 7800 

actgacgcct gaagcccgaa ggtctcgcag 7860 

ggtgctgggg cccgggggtc catggggtgc 7920 

agccctgagc agggggctgg ctctaactca 7980 

acacagcctg cctttaaggg cagcagccgg 8040 

cctgagtgcc agggtcagag tagaatctgg 8100 

tgctccagat cacagggacc tggtacgtga 8160 

aggacaggag gcccaggaag gtgtccccag 8220 

tggaagccac gttcaccttc atgtgagtgt 8280 

cgctcccctc cccagaccca cctggtgccc 8340 

agagtcaccc cctgccttgg agggaaacag 8400 

ggacagtgga attttcaggt tgccgggtca 8460 

ccatgggtgt cctgggagcc gctgcccgcg 8520 

tgcatccaga agaaaatcct gatgcggaag 8580 

gagcccctcc cccactccca cccccaccct 8640 

ccacaggtag agagtgccca ggctgccctt 8700 

aggaggggct ggcctctcct tcctgggggg 8760 

gacaggcttg tccgcagtcg agatggacca 8820 

cattgacaca ttgaggaagc tgaggattgg 8880 

ggctgagggt tagggatggg gaagctgagg 8940 

tggggaggct gagggttagg gacgggggcc 9000 

gggatgagga ggccgagggt tagggatggg 9060 

ggattagaga tggggaggct gagggggaag 9120 

gagcagctgc tggagctggg aaggccggac 9180 

ctccattact tgcattggga ccttcctgag 9240 

cactgtgcgg agcagccagg cctggctcag 93 00 

acattcccga gtgtttcttc atgtggcccc 9360 

tctgtcatct acagatgggg gcaggaagct 9420 

ccactacatc ttttactgca aagaccagca 9480 

tgtgggtgag gggcttgctg gggtctgcgt 9540 

ccggtggaac cagcaccacc atgccctggc 9600 

tttcatgtgc cgggcaccgc tgggcagccg 9660 

gccaggcagg gacagacatg gccctcagtg 9720 

tctcgggtgt aggagggtca ccttaagggg 9780 

cttcatgggg cagggaccct ccagggcagg 9840 

ccctccaggt gagggctgtg accctgcagg 9900 

ggttgccgag tggcttggag agtccccgcc 9960 

gggcagtgac tgtctcctct gtccccagtc 10020 

ctcctctcct gcttgtccgg tgctgggcta 10080 

cagagctctg gtccactctt gggtctctcc 10140 

cctctgtccc cagccccacc cccgagctct 10200 

ggtgctgggc cgtggtcgtc tcctttcggc 10260 

cccacgtcgg ctcacctggc cacctcacct 10320 

gccctggaag aatttaagaa attggtgcag 10380 

acgcccctgc agacgggtga ggatggctgt 10440 

tgtctgctat ctccagtgtc ccatgacccc 10500 

tgtgccccga gtctcctcgc aggggctccc 10560 

gctctgtgct ctgggctgcg atggggtctg 10620 

tggggctccg cactctgggc tgcgatgggg 10680 

gctctggggc tctgagctct gggctgtgat 10740 

taatccctgg gtttttcttg gtctctgcag 10800 

gagcctttag gagggcactg gacaagccca 10860 

atctgctctg gcccctccca tcccacacag 10920 

ggttggcacc tcagagtctg cccacccaca 10980 

ttatcattcc tgcacctgcc agcccgagtg 11040 

cctccctccg gctccctcag gtgtcagctc 11100 

cggtccactc cccaccatcc acccaccaag 11160 

gagctgaggt cgctctggag ccccgaggtt 11220 
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gcccagcagt ggccgatgtg ggggctgcag 
ctgccctacc ctgcagcctc cctgacctct 
tctgcacctc cagagcccac cctaccacca 
ctccagccat gacccttccc tgctcccacc 
cagctctggg caggcctatc tgtggggaca 
tgctgggagg gggagccaca agggaagctg 
cacaagaggc acgatgtcca acaggcctga 
ggaagagccg ccctcaggtc tcaggttcag 
gtttggaaag ccactgggga ggacaggagc 
ggccaccgca gggctcccgc ctcaggggtc 
agcagcctgg gctcagtggc agcactggag 



agcctgggga gggagctctg gccctggcct 11280 
gctcctcttc ccagcacagc agcccccggg 11340 
gacacagagc ccggaccacc tggacctacc 11400 
cacctgactc caaataaagt ccttctcccc 11460 
gaggggcttg cacccgctcc ctgggaggtc 11520 
gggagatgtt ggacctagca agggccaaac 11580 
ggggctgcgt gatgtcctgg aggggcctcc 11640 
cttaggacag gcagggccct gggcaggagc 11700 
ggggacacgg cgtcagggct gcagcagtgg 11760 
tcagagggat cctccaagct cccccatttt 11820 
a 11851 



<210> 17 

<211> 253 

<212> DNA 

<213> Homo sapiens 

<400> 17 

cactccccct cccggatgca ggactggttg 
tgctcccagg acactggcca gggggctata 
ttgttcagac gcccagtgac ctgccgaggt 
ctgttcctgg gtgtcacgct cggcctggcc 
gatgtgagct ggg 



ccaccatggg gaggggtgca ccgtgccacg 6 0 
aagaacatct cgagaggagc cagcacagcc 12 0 
cggcagcaca gagctctgga gatgaagacc 18 0 
gctgccctgt ccttcaccct ggaggaggag 24 0 
253 



<210> 18 
<211> 154 
<212> DNA 
<213> Homo 



sapiens 



<400> 18 

cctgctccag atcacaggga cctggtacgt 
ggaggacagg aggcccagga aggtgtcccc 
cttggaagcc acgttcacct tcatgtgagt 



gaaggccatg gtggtcgata aggactttcc 60 
agtgaaggtg acagccctgg gcggtgggaa 12 0 
gttg 154 



<210> 19 

<211> 91 

<212> DNA 

<213> Homo sapiens 

<400> 19 

tgttttccag gagggaggat cggtgcatcc agaagaaaat cctgatgcgg aagacggagg 60 
agcctggcaa atacagcgcc tgtgagcccc t 91 



<210> 20 

<211> 126 

<212> DNA 

<213> Homo 



sapiens 



<400> 20 

ggtgtgacag gcttgtccgc agtcgagatg 
ctagccattg acacattgag gaagctgagg 
ggggag 



gaccagatca cgcctgccct ctgggaggcc 60 
attgggacaa ggaggccaag gattaggtac 12 0 
126 
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<210> 21 

<211> 131 

<212> DNA 

<213> Homo sapiens 



<400> 21 

tcatctacag atgggggcag gaagctcata 
tacgtctttt actgcaaaga ccagcgccgt 
ggtgaggggc c 



tacctgcagg agctgcccgg gacggacgac 60 
gggggcctgc gctacatggg aaagcttgtg 12 0 
131 



<210> 22 
<211> 188 
<212> DNA 
<213> Homo , 



<400> 22 

ctccttttag catctgctcc ctgcagggcc 
ccacctcacc tgcaggtagg aatcctaata 
aattggtgca gcacaaggga ctctcggagg 
aggatcgg 



gtgccgctgt ccccacgtcg gctcacctgg 60 
ccaacctgga ggccctggaa gaatttaaga 12 0 
aggacatttt cacgcccctg cagacgggtg 18 0 
188 



<210> 23 

<211> 44 

<212> DNA 

<213> Homo sapiens 

<400> 23 

gtctctgcag gaagctgcgt tctcgaacac tagggtgagt gage 



<210> 24 
<211> 130 
<212> DNA 

<213> Homo sapiens 



<400> 24 

cccagcacag cagcccccgg gtctgcacct 
cccggaccac ctggacctac cctccagcca 
ccaaataaag 



ccagagccca ccctaccacc agacacagag 6 0 
tgacccttcc ctgctcccac ccacctgact 120 
130 



<210> 25 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 25 

Gly Thr Trp Tyr 



hi , , 



23 



<210> 26 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 26 

cctgcaggta catgagcttc c 21 



<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 



<210> 28 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 28 

gactcgagtc gacatcgatt tttttttttt ttttt 35 



<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 



<400> 27 

cgctgtattt gccaggctcc 



20 



<400> 29 

cctctcccag ccccagcaag 



20 



<210> 30 
<211> 17 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 30 
gactcgagtc 



<210> 31 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Prii 
<400> 31 

cgcccagtga cctgccgagg tc 



<210> 32 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 32 

ctttatttgg agtcaggtgg gtg 



<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 33 

ctctgccccc tctgctgatg 



<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 34 

cctgcttcac caccttcttg 

<210> 35 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



<400> 35 

gactcagact ccggagatga 



20 



<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 36 

aactcagaca ccagagatga 20 



<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



<210> 38 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 38 

ccaggaggga ccactaca 18 



<210> 39 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



<400> 37 

gactcagatc ccggagatga 



20 



<400> 39 

ccgggacgga cgactacg 



18 
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<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 40 

ctcatgacca cagtccatgc 



<210> 41 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 41 

tgttcgggaa cgcagctt 



<210> 42 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 42 

tgccgctgtc cccacgtcgg 



<210> 43 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
motif 

<400> 43 

Ala Leu Trp Glu Ala Leu Ala lie Asp Thr Arg Leu Lys 



27 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 44 

gtcggatccc tgtccttcac cctggagg 28 



<210> 45 

<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 45 

gtcctcgagg tgttcgggaa cgcagcttc 



<210> 46 
<211> 176 
<212> PRT 

<213> Homo sapiens 
<400> 46 

Met Lys Pro Leu Leu Leu Ala Val Ser Leu Gly Leu lie Ala Ala Leu 
15 10 15 

Gin Ala His His Leu Leu Ala Ser Asp Glu Glu lie Gin Asp Val Ser 

20 25 30 

Gly Thr Trp Tyr Leu Lys Ala Met Thr Val Asp Arg Glu Phe Pro Glu 
35 40 45 

Met Asn Leu Glu Ser Val Thr Pro Met Thr Leu Thr Thr Leu Glu Gly 
50 55 60 

Gly Asn Leu Glu Ala Lys Val Thr Met Leu lie Ser Gly Arg Cys Gin 
65 70 75 80 

Glu Val Lys Ala Val Leu Glu Lys Thr Asp Glu Pro Gly Lys Tyr Thr 
85 90 95 

Ala Asp Gly Gly Lys His Val Ala Tyr lie lie Arg Ser His Val Lys 
100 105 110 

Asp His Tyr lie Phe Tyr Cys Glu Gly Glu Leu His Gly Lys Pro Val 
115 120 125 

Arg Gly Val Lys Leu Val Gly Arg Asp Pro Lys Asn Asn Leu Glu Ala 
130 135 140 

Leu Glu Asp Phe Glu Lys Ala Ala Gly Ala Arg Gly Leu Ser Thr Glu 
145 150 155 160 
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Ser lie Leu lie Pro Arg Gin Ser Glu Thr Cys Ser Pro Gly Ser Asp 
165 170 175 



<210> 47 

<211> 176 

<212> PRT 

<213> Rattus sp. 

<400> 47 

Met Lys Ser Arg Leu Leu Thr Val Leu Leu Leu Gly Leu Met Ala Val 
15 10 15 

Leu Lys Ala Gin Glu Ala Pro Pro Asp Asp Gin Glu Asp Phe Ser Gly 
20 25 30 

Lys Trp Tyr Thr Lys Ala Thr Val Cys Asp Arg Asn His Thr Asp Gly 
35 40 45 

Lys Arg Pro Met Lys Val Phe Pro Met Thr Val Thr Ala Leu Glu Gly 
50 55 60 

Gly Asp Leu Glu Val Arg lie Thr Phe Arg Gly Lys Gly His Cys His 



Leu Arg Arg lie Thr Met His Lys Thr Asp Glu Pro Gly Lys Tyr Thr 

85 90 95 

Thr Phe Lys Gly Lys Lys Thr Phe Tyr Thr Lys Glu lie Pro Val Lys 

100 105 110 

Asp His Tyr lie Phe Tyr lie Lys Gly Gin Arg His Gly Lys Ser Tyr 

115 120 125 

Leu Lys Gly Lys Leu Val Gly Arg Asp Ser Lys Asp Asn Pro Glu Ala 

130 135 140 

Met Glu Glu Phe Lys Lys Phe Val Lys Ser Lys Gly Phe Arg Glu Glu 

145 150 155 160 

Asn lie Thr Val Pro Glu Leu Leu Asp Glu Cys Val Pro Gly Ser Asp 

165 170 175 



<210> 48 

<211> 178 

<212> PRT 

<213> Bovine sp . 

<400> 48 

Met Lys Cys Leu Leu Leu Ala Leu Ala Leu Thr Cys Gly Ala Gin Ala 



Leu lie Val Thr Gin Thr Met Lys Gly Leu Asp lie Gin Lys Val Ala 
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Gly Thr Trp Tyr Ser Leu Ala Met Ala Ala Ser Asp lie Ser Leu Leu 

35 40 45 

Asp Ala Gin Ser Ala Pro Leu Arg Val Tyr Val Glu Glu Leu Lys Pro 

50 55 60 ' 

Thr Pro Glu Gly Asp Leu Glu lie Leu Leu Gin Lys Trp Glu Asn Gly 

65 70 75 80 

Glu Cys Ala Gin Lys Lys lie lie Ala Glu Lys Thr Lys lie Pro Ala 



Val Phe Lys lie Asp Ala Leu Asn Glu Asn Lys Val Leu Val Leu Asp 
100 105 110 

Thr Asp Tyr Lys Lys Tyr Leu Leu Phe Cys Met Glu Asn Ser Ala Glu 
115 120 125 

Pro Glu Gin Ser Leu Ala Cys Gin Cys Leu Val Arg Thr Pro Glu Val 
130 135 140 

Asp Asp Glu Ala Leu Glu Lys Phe Asp Lys Ala Leu Lys Ala Leu Pro 
145 150 155 160 

Met His lie Arg Leu Ser Phe Asn Pro Thr Gin Leu Glu Glu Gin Cys 
165 170 175 



<210> 49 
<211> 180 
<212> PRT 
<213> Murine sp . 

<400> 49 

Met Lys Met Leu Leu Leu Leu Cys Leu Gly Leu Thr Leu Val Cys Val 



His Ala Glu Glu Ala Ser Ser Thr Gly Arg Asn Phe Asn Val Glu Lys 

20 25 30 

lie Asn Gly Glu Trp His Thr lie lie Leu Ala Ser Asp Lys Arg Glu 

35 40 45 

Lys lie Glu Asp Asn Gly Asn Phe Arg Leu Phe Leu Glu Gin lie His 

50 55 60 

Val Leu Glu Asn Ser Leu Val Leu Lys Phe His Thr Val Arg Asp Glu 



Glu Cys Ser Glu Leu Ser Met Val Ala Asp Lys Thr Glu Lys Ala Gly 
85 90 95 

Glu Tyr Ser Val Thr Tyr Asp Gly Phe Asn Thr Phe Thr lie Pro Lys 
100 105 110 
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Thr Asp Tyr Asp Asn Phe Leu Met 
115 120 

Gly Glu Thr Phe Gin Leu Met Gly 
130 135 

Met Ser Asp lie Lys Glu Arg Phe 
145 150 

lie Leu Arg Glu Asn lie He Asp 
165 

Gin Ala Arg Glu 
180 



Ala His Leu He Asn Glu Lys Asp 
125 

Leu Tyr Gly Arg Glu Pro Asp Leu 
140 

Ala Gin Leu Cys Glu Glu His Gly 
155 160 

Leu Ser Asn Ala Asn Arg Cys Leu 
170 175 



<210> 50 

<211> 199 

<212> PRT 

<213> Homo sapiens 

<400> 50 

Met Lys Trp Val Trp Ala Leu Leu Leu Leu Ala Ala Trp Ala Ala Ala 
15 10 15 

Glu Arg Asp Cys Arg Val Ser Ser Phe Arg Val Lys Glu Asn Phe Asp 
20 25 30 

Lys Ala Arg Phe Ser Gly Thr Trp Tyr Ala Met Ala Lys Lys Asp Pro 
35 40 45 

Glu Gly Leu Phe Leu Gin Asp Asn lie Val Ala Glu Phe Ser Val Asp 
50 55 60 

Glu Thr Gly Gin Met Ser Ala Thr Ala Lys Gly Arg Val Arg Leu Leu 
65 70 75 80 

Asn Asn Trp Asp Val Cys Ala Asp Met Val Gly Thr Phe Thr Asp Thr 
85 90 95 

Glu Asp Pro Ala Lys Phe Lys Met Lys Tyr Trp Gly Val Ala Ser Phe 
100 105 110 

Leu Gin Lys Gly Asn Asp Asp His Trp He Val Asp Thr Asp Tyr Asp 
115 120 125 

Thr Tyr Ala Val Gin Tyr Ser Cys Arg Leu Leu Asn Leu Asp Gly Thr 
130 135 140 

Cys Ala Asp Ser Tyr Ser Phe Val Phe Ser Arg Asp Pro Asn Gly Leu 
145 150 155 160 

Pro Pro Glu Ala Gin Lys lie Val Arg Gin Arg Gin Glu Glu Leu Cys 
165 170 175 

Leu Ala Arg Gin Tyr Arg Leu He Val His Asn Gly Tyr Cys Asp Gly 
180 185 190 



Arg Ser Glu Arg Asn Leu Leu 

195 



<210> 51 

<211> 159 

<212> PRT 

<213> Bovine sp . 

<400> 51 

Ala Gin Glu Glu Glu Ala Glu Gin Asn Leu Ser Glu Leu Ser Gly Pro 



Trp Arg Thr Val Tyr lie Gly Ser Thr Asn Pro Glu Lys lie Gin Glu 

20 25 30 

Asn Gly Pro Phe Arg Thr Tyr Phe Arg Glu Leu Val Phe Asp Asp Glu 

35 40 45 

Lys Gly Thr Val Asp Phe Tyr Phe Ser Val Lys Arg Asp Gly Lys Trp 

50 55 60 

Lys Asn Val His Val Lys Ala Thr Lys Gin Asp Asp Gly Thr Tyr Val 

65 70 75 80 

Ala Asp Tyr Glu Gly Gin Asn Val Phe Lys lie Val Ser Leu Ser Arg 



Thr His Leu Val Ala His Asn He Asn Val Asp Lys His Gly Gin Thr 
100 105 110 

Thr Glu Leu Thr Glu Leu Phe Val Lys Leu Asn Val Glu Asp Glu Asp 
115 120 125 

Leu Glu Lys Phe Trp Lys Leu Thr Glu Asp Lys Gly He Asp Lys Lys 
130 135 140 

Asn Val Val Asn Phe Leu Glu Asn Glu Asp His Pro His Pro Glu 
145 150 155 



<210> 52 

<211> 181 

<212> PRT 

<213> Rattus sp. 

<400> 52 

Met Lys Leu Leu Leu Leu Leu Leu Cys Leu Gly Leu Thr Leu Val Cys 
15 10 15 

Gly His Ala Glu Glu Ala Ser Ser Thr Arg Gly Asn Leu Asp Val Ala 
20 25 30 

Lys Leu Asn Gly Asp Trp Phe Ser He Val Val Ala Ser Asn Lys Arg 
35 40 45 



Glu Lys He Glu Glu Asn Gly Ser Met Arg Val Phe Met Gin His He 
50 55 60 
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Asp Val Leu Glu Asn 
65 

Gly Glu Cys Arg Glu 
85 

Gly Glu Tyr Phe Val 
100 

Lys Thr Asp Tyr Asp 

115 

Asn Gly Glu Thr Phe 
130 

Leu Ser Ser Asp lie 
145 

Gly lie Thr Arg Asp 
165 

Leu Gin Ala Arg Gly 
180 



Ser Leu Gly Phe Lys Phe 
70 75 

Leu Tyr Leu Val Ala Tyr 
90 

Glu Tyr Asp Gly Gly Asn 
105 

Arg Tyr Val Met Phe His 
120 

Gin Leu Met Val Leu Tyr 
135 

Lys Glu Lys Phe Ala Lys 
150 155 

Asn lie lie Asp Leu Thr 
170 



Arg lie Lys Glu Asn 
80 

Lys Thr Pro Glu Asp 
95 

Thr Phe Thr lie Leu 
110 

Leu lie Asn Phe Lys 

125 

Gly Arg Thr Lys Asp 
140 

Leu Cys Glu Ala His 
160 

Lys Thr Asp Arg Cys 
175 



<210> 53 

<211> 157 

<212> PRT 

<213> Sus scrofa 

<400> 53 

Gin Glu Pro Gin Pro Glu Gin Asp Pro Phe Glu Leu Ser Gly Lys Trp 
15 10 15 

lie Thr Ser Tyr lie Gly Ser Ser Asp Leu Glu Lys lie Gly Glu Asn 
20 25 30 

Ala Pro Phe Gin Val Phe Met Arg Ser He Glu Phe Asp Asp Lys Glu 
35 40 45 

Ser Lys Val Tyr Leu Asn Phe Phe Ser Lys Glu Asn Gly He Cys Glu 
50 55 60 

Glu Phe Ser Leu He Gly Thr Lys Gin Glu Gly Asn Thr Tyr Asp Val 



Asn Tyr Ala Gly Asn Asn Lys Phe Val Val Ser Tyr Ala Ser Glu Thr 
85 90 95 

Ala Leu lie He Ser Asn He Asn Val Asp Glu Glu Gly Asp Lys Thr 
100 105 110 

He Met Thr Gly Leu Leu Gly Lys Gly Thr Asp He Glu Asp Gin Asp 
115 120 125 



[ 
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Leu Glu Lys Phe Lys Glu Val Thr Arg Glu Asn Gly lie Pro Glu Glu 
130 135 140 

Asn lie Val Asn lie lie Glu Arg Asp Asp Cys Pro Ala 
145 150 155 
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JTOMAELJ3DORANT- BINDING PROTEINS WHICH BIND HYDROPHOBIC 

^ LIGANDS : POLYPEPT IDES AN D POLYNUCLEOTIDES ENCODING 

_ SAID__P.OLYPEPTIDES . AND APP LJ,CAT-IDN.S_J!HEREO^L - 

The present invention relates to the demonstration of 
novel human odorant-binding proteins, hereafter named 
"OBPs", and also to the applications thereof, both 
therapeutically and non- therapeutically . 

The present invention is based on the identification of 
a family of lipocalin genes, composed of three genes 
and two pseudogenes, on human chromosome 9q34; the 
three genes correspond to the LCN1 gene which has 
already been described and to two novel genes which are 
the subject of the present invention and are called 
hOBPIIa and hOBPIIb. In addition, the invention is 
based on the attribution of novel functions to already 
known human lipocalins, through the demonstration of 
new areas of expression. 

Although a certain number of OBPs have already been 
demonstrated (Pelosi et al . , 1996), such as the rat OBP 
for example (see in particular patent EP-0 335 654) , 
the OBPs according to the present invention are human 
OBPs which have a very large number of advantages, as 
will emerge from the remainder of the text, compared to 
the murine proteins . 

These OBP proteins of the lipocalin family have, as far 
as some of them are concerned, been mentioned indirectly 
in patent WO 99/07740, but their OBP function has never 
been described to date; the same is also true of the 
proteins LCNl , retinol-binding protein (RBP) and apo- 
lipoprotein D (ApoD) , as will be more fully explained 
hereinafter . 

Historically, the lipocalin family (Pervaiz and Brew, 
1987), has been defined based on the human retinol- 
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binding protein (RBP) and on 3 other proteins: bovine 
(3-lactoglobulin, rat a2*i-globulin and human al-micro- 
globulin. From these reference proteins, and using 
sequence homologies, the lipocalin family has increased 
in size so as to now incorporate a large number of 
proteins, more than about 100, both in eukaryotes and 
in prokaryotes (Flower et al . , 1995, 1996). This family 
consists of small proteins (160-190 amino acids) which 
contain a hydrophobic pocket and are generally secreted 
(Bocskei et al . , 1992; Senoo et al . , 1990, Zeng et al . , 
1996; Miller, 1998), although, in certain cases, they 
are proteins which remain associated with the membrane 
(Nagata et al . , 1991). In vertebrates, the sequence 
identities between the various lipocalins are in the 
region of 20%; however, the sequence identities are 
greater for the orthologous proteins, and also for the 
recent paralogous genes described (Igarashi et al . , 
1992; Dewald et al . , 1992). 

The present invention relates to the isolated OBPII 
polypeptides encoded by the two new isolated human 
genes, OBPIIa and OBPIlb, located at the 9q34 locus; 
the invention also relates to the corresponding poly- 
nucleotide sequences, the corresponding mPJSIAs and also 
the regulatory promoter sequences which determine the 
expression profile in the various tissues and in 
particular in the secretory tissues. These two genes 
encode at least seven different polypeptides given the 
existence of alternative splicing of the transcripts. 
The OBPIIa gene encodes at least 4 different poly- 
peptides, named OBPII aOC , 0BPII a p, 0BPII aY and 0BPII a6 , and 
the OBPIlb gene encodes three different polypeptides, 
named OBPIIba, 0BPII b p and OBPII^. Even though one gene 
is mainly expressed, all the messenger forms are found 
in the nasal structure. 

The subject of the present invention is therefore an 
isolated polypeptide comprising an amino acid sequence 
having at least 9 0% identity with the amino acid 



sequences SEQ ID No . 2, SEQ ID No . 4, SEQ ID No . 6, SEQ 
ID No. 8, SEQ ID No. 10, SEQ ID No. 12 or SEQ ID No. 14, 
and LCN1, ApoD and RBP . 

It should be understood that the invention relates to 
the polypeptides obtained by purification from natural 
sources or obtained using recombinant techniques, as 
will be described subsequently; the invention also 
relates to the polypeptides obtained by chemical 
synthesis, which may then comprise unnatural amino 
acids. In the present description, the term "poly- 
peptide" will be used to denote equally a protein or a 
peptide . 

The expression "percentage identity between the 
sequences" is intended to denote the percentage of 
amino acids which are identical between the sequences, 
obtained with the best possible sequence alignment, 
this percentage being purely statistical and the 
differences between the polypeptides being distributed 
randomly and over the entire length. 

The present invention also relates to an isolated poly- 
peptide, characterized in that it comprises a poly- 
peptide chosen from: 

a) a polypeptide of sequence SEQ ID No . 2, SEQ ID No . 4, 
SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID 
No. 12 or SEQ ID No. 14; 

b) a polypeptide which is a variant of a polypeptide of 
amino acid sequences defined in a) ; 

c) a polypeptide homologous to the polypeptide defined 
in a) or b) and comprising at least 90% identity 
with said polypeptide of a) ; 

d) a fragment of at least 15 consecutive amino acids of 
a polypeptide defined in a) , b) or c) ; 
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e) a biologically active fragment of a polypeptide 
defined in a) , b) or c) . 

5 The term "variant polypeptide" will be intended to mean 
all the mutated polypeptides which may exist naturally, 
in particular in humans, and which correspond in 
particular to truncations, substitutions, deletions 
and/or additions of amino acid residues. The variant 
10 polypeptides according to the invention conserve at 
least one hydrophobic ligand-binding domain. 

The term "homologous polypeptide" will be intended to 
denote the polypeptides exhibiting, compared to the 
15 polypeptides of SEQ ID No. 2, SEQ ID No. 4, SEQ ID 
No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No. 12 and 
SEQ ID No. 14, certain modifications, such as in 
particular a deletion, addition or substitution of at 
least one amino acid, a truncation, an extension and/or 

2 0 a chimeric fusion. Among the homologous polypeptides, 

preference is given to those in which the amino acid 
sequence exhibits at least 90% identity, in preference 
95%, preferably 97%, and even more preferably 99% 
identity with the amino acid sequences of the poly- 
25 peptides according to the invention. In the case of a 
substitution, one or more amino acids, which may or may 
not be consecutive, are replaced with "equivalent amino 
acids". The expression "equivalent amino acid" is 
herein aimed at denoting any amino acid which can be 

3 0 substituted for one of the amino acids of the basic 

structure, without however modifying the essential 
functional properties or characteristics of the 
corresponding polypeptides, their biological activities, 
such as for example the in vivo induction of antibodies 
3 5 capable of recognizing the polypeptide, the amino acid 
sequence of which is included in the amino acid 
sequence SEQ ID No . 2, SEQ ID No . 4, SEQ ID No . 6, SEQ 
ID No. 8, SEQ ID No. 10, SEQ ID No. 12 or SEQ ID 
No. 14, or a fragment thereof. These equivalent amino 



acids may be determined either based on their structural 
homology with the amino acids for which they substitute, 
or based on the results of cross biological assays 
which the various polypeptides may produce. By way of 
5 example, mention will be made of the possibilities of 
substitutions which can be made without their being, as 
a result of this, an extensive modification of the 
biological activities of the correspondingly modified 
polypeptides; for example, replacing leucine with 

10 valine or isoleucine, aspartic acid with glutamic acid, 
glutamine with asparagine, arginine with lysine, etc., 
it naturally being possible to envision the reverse 
substitutions under the same conditions. Thus, it is 
possible to envision introducing certain modifications, 

15 such as in particular a deletion, addition or substi- 
tution of at least one amino acid, into the alpha- 
helices of the protein without destroying the calyx 
formed by the structure composed of the beta-sheets; 
similarly, it is possible to introduce equivalent amino 

2 0 acids which make it possible to conserve the hydrophobic 

nature of the beta-sheets. It may also be advantageous 
to introduce modifications into the sequence of the 
polypeptides of the invention in order to generate 
homologous polypeptides lacking protease sites. 

25 

The term "biologically active fragment" will be intended 
to denote, in particular, a fragment of amino acid 
sequence of a polypeptide according to the invention, 
exhibiting at least one of the functional properties or 
30 characteristics of the polypeptides according to the 
invention, in particular in that: (i) it is capable of 
being recognized by an antibody specific for a poly- 
peptide according to the invention or by antibodies 
produced by patients during an immune reaction; (ii) it 

3 5 has at least one of the domains or regions as defined 

below; (iii) it is capable of binding a hydrophobic 
ligand and in particular odorous molecules, preferably 
pheromones; (iv) it is capable of specifically binding 
a receptor. 
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The term "polypeptide fragment" is intended to denote a 
polypeptide comprising a minimum of 15 amino acids, in 
preference 18 amino acids, preferably 25 amino acids 
5 and even more preferably 50 amino acids. The poly- 
peptide fragments according to the invention, obtained 
by cleavage of said polypeptide with a proteolytic 
enzyme, or with a chemical reagent, or by placing said 
polypeptide in a very acid environment, are also part 

10 of the invention. When the intention is to use a 
sequence of 15, 18, 2 5 or 50 amino acids, this, of 
course, preferably involves the portions corresponding 
to functional epitopes of the above polypeptides, which 
may be integrated into a polypeptide with a longer 

15 structure, for example in the form of a fusion protein; 
this will depend on the intended uses of these poly- 
peptides . 

According to a preferred embodiment, the present 
20 invention relates to an isolated polypeptide selected 
from a polypeptide corresponding to the sequence SEQ ID 
No. 2 and named OBPII aa , to the sequence SEQ ID No. 4 
and named OBPII aP , to the sequence SEQ ID No. 6 and 
named 0BPII aY , to the sequence SEQ ID No. 10 and named 

2 5 0BPII b a, and to the sequence SEQ ID No. 12 and named 

0BPII b p. 

The polypeptides according to the invention are 
characterized in that they preferably comprise the 

3 0 Gly-Thr-Trp-Tyr domain. 

The invention also relates to the isolated polynucleo- 
tide characterized in that it encodes a polypeptide as 
described above. The polynucleotide according to the 
35 invention is chosen from the polynucleotide of sequence 
SEQ ID No. 1, SEQ ID No . 3, SEQ ID No . 5, SEQ ID No . 7, 
SEQ ID No. 9, SEQ ID No. 11 or SEQ ID No. 13. 



The invention also relates to an isolated poly- 
nucleotide, characterized in that it comprises a 
polynucleotide chosen from: 

a) a polynucleotide of sequence SEQ ID No. 1, SEQ ID 
No. 3, SEQ ID No. 5, SEQ ID No . 7, SEQ ID No . 9, SEQ 
ID No. 11 or SEQ ID No. 13, or the sequence of which 
is that of the RNA corresponding to the sequence SEQ 
ID No. 1, SEQ ID No. 3, SEQ ID No . 5, SEQ ID No . 7, 
SEQ ID No. 9, SEQ ID No. 11 or SEQ ID No. 13; 

b) a polynucleotide, the sequence of which is comple- 
mentary to the sequence of a polynucleotide defined 
in a) , 

c) a polynucleotide, the sequence of which comprises at 
least 80% identity, in preference 90%, preferably 
95%, and even more preferably 97% identity with a 
polynucleotide defined in a) or b) , 

d) a polynucleotide which hybridizes, under conditions 
of high stringency, with a sequence of a poly- 
nucleotide defined in a) , b) or c) , 

e) a fragment of at least 15 consecutive nucleotides, 
in preference 21 consecutive nucleotides, and 
preferably 3 0 consecutive nucleotides, of a poly- 
nucleotide defined in a) , b) , c) or d) . 

In the present description, the terms "polynucleotide", 
"oligonucleotide", "polynucleotide sequence", "nucleo- 
tide sequence" or "nucleic acid" will be intended to 
denote a DNA fragment, both a double- stranded DNA, a 
single-stranded DNA and products for transcription of 
said DNAs, and/or an RNA fragment, said synthetic or 
isolated natural fragments, possibly comprising 
unnatural nucleotides, denoting a precise chain of 
nucleotides, which may or may not be modified, making 
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it possible to define a fragment or- a region of a 
nucleic acid. 



The expression "polynucleotide of complementary 
5 sequence" is intended to denote any DNA, the nucleotides 
of which are complementary to those of SEQ ID No. 1, 
SEQ ID No. 3, SEQ ID No . 5, SEQ ID No . 7, SEQ ID No . 9, 
SEQ ID No. 11 or SEQ ID No. 13, or a portion of SEQ ID 
No. 1, SEQ ID No. 3, SEQ ID No . 5, SEQ ID No . 7, SEQ ID 
10 No. 9, SEQ ID No. 11 or SEQ ID No. 13, and the orienta- 
tion of which is reversed. 

For the purpose of the present invention, the term 
"percentage identity" is intended to mean a percentage 
15 of bases which are identical between the polynucleo- 
tides, obtained after the best alignment, this 
percentage being purely statistical and the differences 
between the two polynucleotides being distributed 
randomly and over their entire length. 

20 

For the purpose of the present invention, the expression 
"hybridization under conditions of high stringency" 
means that the conditions of temperature and ionic 
strength are chosen such that they allow the hybridiza- 
25 tion between two complementary DNA fragments to be 
maintained. By way of illustration, conditions of high 
stringency for the hybridization step for the purposes 
of defining the polynucleotide fragments described 
above are advantageously as follows: 

30 

The DNA -DNA or DNA-RNA hybridization is carried out in 
two steps: (1) prehybridization at 42 °C for 3 hours in 
phosphate buffer (20 mM, pH 7.5) containing 5 x SSC 
(1 x SSC corresponds to a solution of 0.15 M NaCl + 
35 0.015 M sodium citrate), 50% of formamide, 7% of sodium 
dodecyl sulfate (SDS), 10 x Denhardt's, 5% of dextran 
sulfate and 100 Mg/ml of salmon sperm DNA; (2) actual 
hybridization for 2 0 hours at a temperature which 
depends on the length of the probe (i.e.: 42 °C for a 
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probe of length > 100 nucleotides) , followed by 2 washes 
for 20 minutes at 20°C in 2 x SSC + 2% SDS, 1 wash for 
20 minutes at 2 0°C in 0.1 X SSC + 0.1% SDS. The final 
wash is carried out in 0.1 X SSC + 0.1% SDS for 
5 3 0 minutes at 60°C for a probe of length > 100 nucleo- 
tides. The high stringency hybridization conditions 
described above for a nucleotide of defined length will 
be adjusted by those skilled in the art for longer or 
shorter oligonucleotides, according to the teaching of 
10 Sambrook et al . (1989). 

Advantageously, a nucleotide fragment corresponding to 
the above definition will have at least 15 consecutive 
nucleotides, preferably at _!east 21 nucleotides, and 
15 even more preferentially at least 3 0 consecutive 
nucleotides, of the sequence from which it is derived. 

According to one embodiment of the invention, the poly- 
nucleotide according to the invention is characterized 
20 in that it is directly or indirectly labeled with a 
radioactive compound or a nonradioactive compound. The 
polynucleotide according to the invention is used as a 
primer for amplifying or polymerizing nucleic acid 
sequences; the invention also relates to the use of a 

2 5 polynucleotide according to the invention as a probe 

for detecting nucleic acid sequences. According to the 
invention, the polynucleotide fragments which may be 
used as a probe or as a primer in processes for 
detecting, identifying, titering or amplifying nucleic 

3 0 acid sequence will have a minimum length of 9 bases, 

preferably 18 bases, and even more preferably 3 6 bases. 
Finally, the invention relates to the use of a poly- 
nucleotide according to the invention as a sense or 
antisense oligonucleotide for controlling the expression 
35 of the corresponding protein product, in this case a 
polypeptide according to the invention. 

The unlabeled polynucleotide sequences according to the 
invention may be used directly as a probe, primer or 
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oligonucleotide; however, the sequences used are 
generally labeled so as to obtain sequences which can 
be used for many applications. The primers, probes and 
oligonucleotides according to the invention are labeled 
with radioactive elements or with nonradioactive 
molecules; among the radioactive isotopes used, mention 
may be made of 32 P, 33 P, 35 S, 3 H or 125 X . The nonradio- 
active entities are selected from ligands, such as 
biotin, avidin, streptavidin or digoxigenin, haptens, 
dyes and luminescent agents, such as radioluminescent , 
chemi luminescent , bioluminescent , fluorescent or 
phosphorescent agents . 

The invention also comprises a method for detecting 
and/or titering nucleic acid according to the invention, 
in a biological sample, characterized in that it 
comprises the following steps: (i) isolating the DNA 
from the biological sample to be analyzed, or obtaining 
a cDNA from the RNA of the biological sample; (ii) 
specifically amplifying the DNA encoding the poly- 
peptide according to the invention using primers; (iii) 
analyzing the amplification products. 

The invention also comprises a kit for detecting and/or 
titering a nucleic acid according to the invention, in 
a biological sample, characterized in that it comprises 
the following elements: (i) a pair of nucleic acid 
primers according to the invention, (ii) the reagents 
required to perform a DNA amplification reaction and, 
optionally, (iii) a component for verifying the 
sequence of the amplified fragment, more particularly a 
probe according to the invention. 

The invention also comprises a method for detecting 
and/or titering nucleic acid according to the invention, 
in a biological sample, characterized in that it 
comprises the following steps: (i) bringing a probe 
according to the invention into contact with a 
biological sample; (ii) detecting and/or titering the 
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hybrid formed between said probe and the DNA of the 
biological sample. 



The invention also comprises a kit for detecting and/or 
5 titering nucleic acid according to the invention, in a 
biological sample, characterized in that it comprises 
the following elements: (i) a probe according to the 
invention, (ii) the reagents required to carry out a 
hybridization reaction and, where appropriate, (iii) a 
10 pair of primers according to the invention, and also 
the reagents required for a DNA amplification reaction. 

The invention in particular relates to the methods 
according to the invention described above, for 
15 detecting and diagnosing cells of cancerous origin, and 
mainly breast, uterine, ovarian, prostate and lung 
cancers . 

The polynucleotides according to the invention may thus 

2 0 be used as a primer and/or probe in processes using in 

particular the PCR (polymerase chain reaction) technique 
(Erlich, 1989; Innis et al . , 1990, and Rolfs et al . , 
1991) . This technique requires choosing pairs of oligo- 
nucleotide primers which border the fragment which must 
25 be amplified. Reference may, for example, be made to 
the technique described in American patent U.S. No. 
4 683 202. The amplified fragments can be identified, 
for example after agarose or polyacryl amide gel 
electrophoresis or after a chromatography technique 

3 0 such as gel filtration or ion-exchange chromatography. 

The specificity of the amplification can be controlled 
by molecular hybridization using, as a probe, the 
nucleotide sequences polynucleotides of the invention, 
plasmids containing these sequences or products of 
35 amplification thereof. The amplified nucleotide 
fragments may be used as reagents in hybridization 
reactions in order to demonstrate the presence, in a 
biological sample, of a target nucleic acid of sequence 
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complementary to that of said amplified nucleotide 
fragments . 



The invention is also directed toward the nucleotide 
5 fragments which can be obtained by amplification using 
primers according to the invention. 

Other techniques for amplifying target nucleic acid may 
advantageously be used as an alternative to PCR (PCR- 
10 like) , using pairs of primers of nucleotide sequences 
according to the invention. The term "PCR-like" will be 
intended to denote all methods using direct or indirect 
reproductions of nucleic acid sequences, or in which 
the labeling systems have been amplified; these 
15 techniques are of course known; in general, this 
involves amplifying the DNA with a polymerase; when the 
sample of origin is an RNA, a reverse transcription 
should be performed beforehand. There are currently a 
large number of processes for this amplification, such 
2 0 as for example the SDA (strand displacement amplifica- 
tion) technique (Walker et al . , 1992), the TAS 
(transcription-based amplification system) technique 
described by Kwoh et al . in 1989, the 3SR (self- 
sustained sequence replication) technique described by 
25 Guatelli et al . in 1990, the NASBA (nucleic acid 
sequence based amplification) technique described by 
Kievitis et al . in 1991, the TMA (transcription 
mediated amplification) technique, the LCR (ligase 
chain reaction) technique described by Landegren et al . 
30 in 1988 and improved by Barany et al . in 1991, which 
uses a heat-stable ligase, the RCR (repair chain 
reaction) technique described by Segev in 1992, the CPR 
(cycling probe reaction) technique described by Duck et 
al. in 1990, and the Q-beta-replicase amplification 
35 technique described by Miele et al . in 1983 and 
improved in particular by Chu et al . in 1986 and 
Lizardi et al . in 1988, and then by Burg et al . (1996) 
and also by Stone et al . (1996) . 
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When the target polynucleotide to be detected is an 
RNA, for example an mRNA, use will advantageously be 
made, prior to carrying out an amplification reaction 
using the primers according to the invention or to 
5 carrying out a detection process using the probes of 
the invention, of an enzyme of the reverse transcriptase 
type in order to obtain a cDNA from the RNA contained 
in the biological sample. The cDNA obtained will then 
be used as a target for the primers or the probes used 
10 in the amplification or detection process according to 
the invention. 

The nucleotide probes according to the invention 
hybridize specifically with a DNA or RNA molecule of a 

15 polynucleotide according to the invention, more 
particularly with the sequences SEQ ID No. 1, SEQ ID 
No. 3, SEQ ID No. 5, SEQ ID No . 7, SEQ ID No . 9, SEQ ID 
No. 11 and SEQ ID No. 13, under high stringency 
hybridization conditions as given in the form of a 

20 previous example. 

The hybridization technique may be carried out in 
diverse ways (Matthews et al . , 1988). The most general 
method consists in immobilizing the nucleic acid 

2 5 extracted from the cells of various tissues or from 

cells in culture, on a support (such as nitrocellulose, 
nylon or polystyrene) , and in incubating the 
immobilized target nucleic acid with the probe, under 
well-defined conditions. After hybridization, the 

3 0 excess probe is removed and the hybrid molecules formed 

are detected by the suitable method (measurement of the 
radioactivity, of the fluorescence or of the enzymatic 
activity linked to the probe) . 

3 5 According to another embodiment of the nucleic acid 
probes according to the invention, they may be used as 
a capture probe. In this case, a probe, termed "capture 
probe", is immobilized on a support and is used to 
capture, by specific hybridization, the target nucleic 
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acid obtained from the biological sample to be tested, 
and the target nucleic acid is then detected using a 
second probe, termed "detection probe", labeled with an 
easily detectable element. 

5 

In a preferred embodiment, the invention comprises the 
use of a sense or antisense oligonucleotide for 
controlling the expression of the corresponding protein 
product. Among the advantageous nucleic acid fragments, 

10 mention should in particular be made of the antisense 
oligonucleotides the structure of which ensures, by- 
hybridization with the target sequence, inhibition of 
the expression of the corresponding product. Mention 
should also be made of the sense oligonucleotides 

15 which, by interaction with proteins involved in 
regulating the expression of the corresponding product, 
will induce either inhibition or activation of this 
expression. The oligonucleotides according to the 
invention have a minimum length of 9 bases, preferably 

2 0 of 18 bases, and even more preferably 3 6 bases. 

The invention relates to a recombinant vector for 
cloning a polynucleotide according to the invention 
and/or for expressing a polynucleotide according to the 
invention, characterized in that it contains a poly- 
nucleotide according to the invention as described 
above. The vector according to the invention is 
characterized in that it comprises the elements which 
allow the expression of said sequences in a host cell 
and, optionally, the secretion of said sequences out of 
the host cell. 

The term "expression vector" is intended to mean both 
autonomously replicating expression vectors of the 
35 plasmid type and systems intended to ensure integration 
in the cells, but these expression vectors may also be 
expression vectors of the viral type or even, when the 
intention is to perform, for example, gene therapy, 
naked DNA. Among the viral vectors, preference is given 



25 



30 
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to those derived from adenoviruses, from adenovirus- 
associated viruses (AAVs) , from retroviruses, from 
lentiviruses , and preferably HIV derivatives, from 
poxviruses and from the herpesvirus, for expression in 
5 a eukaryotic system. Among the nonviral vectors, 
preference is given to naked polynucleotides, such as 
naked DNA or naked RNA according to the technique 
developed by the company VICAL, yeast artificial 
chromosomes (YACs) for expression in yeast, mouse 
10 artificial chromosomes (MACs) for expression in murine 
cells and, preferably, human artificial chromosomes 
(HACs) for expression in human cells. 

According to a particular embodiment, the vector 
15 according to the invention comprises elements for 
controlling the expression of the polypeptides; these 
control elements are preferably chosen from (i) the 
promoter sequence of the hOBPIIa gene according to the 
invention, which corresponds to the sequence SEQ ID 
20 No. 15 and/ or from the promoter sequence of the hOBPIIb 
gene according to the invention, which corresponds to 
the sequence SEQ ID No. 16; (ii) a polynucleotide, the 
sequence of which is complementary to the sequence SEQ 
ID No. 15 and SEQ ID No. 16; (iii) a polynucleotide, 
25 the sequence of which comprises at least 80% identity 
with a polynucleotide defined in ( (i) or (ii); (iv) a 
polynucleotide which hybridizes, under conditions of 
high stringency, with the polynucleotide sequence 
defined in (i), (ii) or (iii). 

3 0 

The computer tools available to those skilled in the 
art allow them to easily identify the regulatory 
promoter boxes required and sufficient to control the 
gene expression, in particular the TATA, CCAAT and GC 
35 boxes, and also the stimulatory (enhancer) or 
inhibitory (silencer) regulatory sequences which 
control, in CIS, the expression of the genes according 
to the invention; among these regulatory sequences, 
mention should be made of IRE, MRE and CRE . 
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It is also within the scope of the invention to use the 
elements defined above and chosen from the sequence SEQ 
ID No. 15 and SEQ ID No. 16, for controlling the 
5 expression of heterologous polypeptides other than 
those of the invention, and in particular for directing 
the expression of heterologous polypeptides in the cell 
types in which the polypeptides according to the 
invention are normally expressed. 

10 

The invention also comprises the host cells, in 
particular the eukaryotic and prokaryotic cells, 
characterized in that they are transformed with the 
vectors according to the invention. The host cells are 

15 preferably transformed under conditions which allow the 
expression of a recombinant polypeptide according to 
the invention. The host cell may be chosen from 
bacterial cells, but also from yeast cells and from 
plant and animal cells; the host cell is preferably a 

20 mammalian cell (Edwards and Aruffo, 1993), but also an 
insect cell in which it is possible to use processes 
which use baculoviruses for example (Luckow, 1993) . 
These cells may be obtained by introducing into host 
cells a nucleotide sequence inserted into a vector as 

25 defined above, and then culturing said cells under 
conditions which allow the replication and/or expression 
of the transfected nucleotide sequence. 

According to a particular embodiment, the invention 
30 also relates to a transgenic animal or plant which 
comprises host cells according to the invention. 

The invention also relates to a method for preparing a 
polypeptide, characterized in that it uses a vector 
3 5 according to the invention. More particularly, the 
invention relates to a method for preparing a recombi- 
nant polypeptide, characterized in that transformed 
cells according to the invention are cultured under 
conditions which allow the expression of said 
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recombinant polypeptide, and in that said recombinant 
polypeptide is recovered. 

The polypeptide according to the invention can be 
obtained according to a process of the invention and 
according to the techniques for producing recombinant 
polypeptides known to those skilled in the art. The 
present invention therefore relates to the recombinant 
polypeptide which can be obtained using the method 
presented above. In this case, the nucleic acid 
sequence used is placed under the control of signals 
which allow its expression in a host cell. An effective 
system for producing a recombinant polypeptide requires 
a vector, for example of plasmid or viral origin, and a 
compatible host cell. The vector must comprise a 
promoter, translation initiation and termination 
signals, and also suitable regions for regulating 
transcription. It must be possible for it to be 
maintained stably in the cell and it may, optionally, 
contain particular signals which specify secretion of 
the translated polypeptide. These various control 
signals are chosen as a function of the host cell used. 
To this effect, the nucleic acid sequences according to 
the invention may be inserted into vectors which 
replicate autonomously in the host chosen, or vectors 
which integrate in the host chosen. Such vectors will 
be prepared according to the methods commonly used by 
those skilled in the art, and the clones resulting 
therefrom may be introduced into a suitable host using 
standard methods, such as for example transfection by 
calcium phosphate precipitation, lipofection, electro- 
poration or heat shock. 

The recombinant polypeptides obtained as indicated 
above may be both in the glycosylated form and 
nonglycosylated form, and may or may not have the 
natural tertiary structure. 
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The polypeptides obtained by chemical synthesis and 
possibly comprising unnatural amino acids, corresponding 
to said recombinant polypeptides, are also included in 
the invention. The peptides according to the invention 
may also be prepared using techniques which are conven- 
tional in the domain of peptide synthesis. This 
synthesis may be performed in homogeneous solution or 
in solid phase. 

The recombinant polypeptide purification processes used 
are known to those skilled in the art. The recombinant 
polypeptide may be purified from cell lysates and 
extracts, and/or from the culture medium supernatant, 
by methods used individually or in combination, such as 
fractionation, chromatography methods, immunoaf f inity 
techniques using specific monoclonal or polyclonal 
antibodies, etc. A preferred variant consists in 
producing a recombinant polypeptide fused to a 
"carrier" protein (chimeric protein) . The advantage of 
this system is that it allows stabilization and a 
decrease in the proteolysis of the recombinant product, 
an increase in solubility during the in vitro 
renaturation and/or simplification of the purification 
when the fusion partner has affinity for a specific 
ligand. 

The present invention relates to the use of a poly- 
peptide chosen from an hOBPIIa and hOBPIIb polypeptide 
according to the invention, or a fragment thereof, and 
from LCNl, retinol-binding protein (RBP) and apolipo- 
protein D (ApoD) , as a protein which binds to a 
hydrophobic ligand, preferably an odorous molecule. In 
fact, besides the OBPII polypeptides mentioned above, 
the OBP function of the lipocalins LCNl , RBP and ApoD 
has never been suggested. Specifically, while the 
presence of LCNl in the nasal mucus has been described 
(Redl et al . , 1992), it was, in fact, attributed 
another function, namely an antiprotease function, 
whereas the main function of the LCNl protein purely 



appears to be the transport of hydrophobic ligands; 
moreover, the tests carried out in the context of the 
present invention with a recombinant LCN1 protein have 
not made it possible to demonstrate antiprotease 
activity. The same is true for retinol-binding protein 
and apolipoprotein D, which have never been presented 
as having OBP function. 

The present invention also relates to the use of a 
polypeptide according to the invention or a fragment 
thereof, LCNl, RBP and apolipoprotein D as a competitive 
inhibitor, as an agonist or as an antagonist of the 
cellular receptors for lipocalins. The use of an 
inhibitor of the binding of lipocalins to their 
receptor may be employed in an anticancer strategy; 
specifically, a certain number of tumors are hormone- 
dependent: thus, breast tumors are sensitive to 
steroids; since the steroids are transported by 
lipocalins, it is within the scope of the invention to 
provide inhibitors of binding to the polypeptides 
according to the invention and LCNl, OBP or APO-D to 
avoid the binding of steroid hormones to the receptors 
of tumor cells. 

The invention also relates to a monoclonal or poly- 
clonal antibody and fragments thereof, characterized in 
that they specifically bind a polypeptide according to 
the invention. Chimeric antibodies, humanized anti- 
bodies and single-chain antibodies are also part of the 
invention. The antibody fragments according to the 
invention are preferably Fab or F(ab')2 fragments. 

The polypeptides according to the invention make it 
possible to prepare monoclonal or polyclonal anti- 
bodies. The monoclonal antibodies may advantageously be 
prepared from hybridomas according to the technique 
described by Kohler and Milstein in 1975. The poly- 
clonal antibodies may be prepared for example by 
immunizing an animal, in particular a mouse, with a 



polypeptide according to the invention combined with an 
adjuvant of the immune response, and then purifying the 
specific antibodies contained in the serum of the 
immunized animals on an affinity column to which the 
5 polypeptide having been used as the antigen has been 
attached beforehand. The polyclonal antibodies according 
to the invention may also be prepared by purification 
on an affinity column on which a polypeptide according 
to the invention has been immobilized beforehand. 

10 

The invention also relates to a monoclonal antibody 
specific for a polypeptide according to the invention 
and capable of inhibiting the interaction between said 
polypeptide and the cellular receptor to which said 

15 polypeptide specifically binds. According to another 
embodiment, the monoclonal antibody according to the 
invention is capable of inhibiting the interaction 
between said polypeptide and its hydrophobic ligands, 
in preference the odorous molecules, and preferably the 

20 pheromones, to which said polypeptide binds. 

The antibodies of the invention may also be labeled in 
the same way as described previously for the nucleic 
acid probes of the invention, and preferably with 

25 labeling of the enzymatic, fluorescent or radioactive 
type. Such labeled antibodies may be used to detect 
these polypeptides in a biological sample. The 
biological sample preferably consists of a fluid, for 
example human biopsies, serum or blood. They thus 

3 0 constitute a means for analyzing the expression of the 
polypeptide according to the invention, for example by 
immunofluorescence and/or gold-labeling, and/or with 
enzymatic immunoconjugates . 

3 5 More generally, the antibodies of the invention may 
advantageously be used in any situation in which the 
expression of a polypeptide according to the invention 
is to be observed, and more particularly in immuno- 
cytochemistry , in immunohistochemistry or in Western 
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blotting experiments, or in ELISA and RIA techniques. 
It is thus within the scope of the invention to provide 
a method for detecting and/or titering a polypeptide 
according to the invention, in a biological sample, 
5 characterized in that it comprises the steps of 
bringing the biological sample into contact with 
antibodies according to the invention, and then 
revealing the antigen-antibody complex formed. 

10 Also falling within the context of the invention is a 
kit for detecting and/or titering a polypeptide 
according to the invention, in a biological sample, 
characterized in that it comprises the following 
elements: (i) a monoclonal or polyclonal antibody as 

15 described above; (ii) where appropriate, the reagents 
for constituting the medium suitable for the immuno- 
logical reaction; (iii) the reagents for detecting the 
antigen-antibody complexes produced by the immuno- 
logical reaction. This kit is in particular useful for 

2 0 carrying out Western blotting experiments and for 

immunoprecipi tation experiments. 

The hOBPIIs according to the present invention may be 
used in many applications. 

25 

The first applications of the OBPs lie in the control 
of odorants; these applications essentially concern 
personal hygiene (perfumery, cosmetology, pharmacy) or 
general hygiene. 

30 

They may be used in a process for controlling the 
volatilization of an odorant; such a process is 
characterized in that it comprises a step of binding of 
said odorant with a polypeptide according to the 

3 5 invention or with the LCN1 , RBP or apolipoprotein D 

proteins . 

It is also possible to bind the odorant to a solid 
support using an OBPII according to the invention 



attached to said support, it being possible for this 
binding to be both covalent binding and noncovalent 
binding, for example by adsorption or by binding of the 
avidin-biotin type, if necessary. Under these 
conditions, a fragranced support is obtained which 
retains the odorants for a longer period of time, 
gradually releasing them and possibly being used both 
in the cosmetics domain and in the domain of maintenance 
products in general . The supports used may be supports 
of the block type or of the ball type, for example. Of 
course, the OBPIIs according to the present invention 
may be more particularly used in the perfumery and 
cosmetics industry, in which the OBPs will be used in 
the form of a liquid mixture intended to control the 
volatilization of the odorants after the composition 
has been spread on human skin. This makes it possible 
to prolong the "staying-power" of fragrances or to form 
part of the composition of body deodorants in 
particular. Of course, the term "fragrance" generally 
encompasses the mixtures known in perfumery, based on 
alcohol or in aqueous form, and containing in particular 
essential oils. The polypeptide according to the 
invention may also form part of the composition of 
retinol-based creams, as an agent for transporting and 
protecting said retinol, for cosmetological applica- 
tions, and in particular for preventing, smoothing out 
and treating wrinkles and fine lines of the skin, and 
combating cutaneous and/or subcutaneous loosening. 

Among the applications relating to general hygiene, the 
process according to the invention may be used in a 
device directed toward deodorizing premises, such as 
for example animal houses or cowsheds. Such a device 
may also be envisioned for deodorizing the flow of air 
entering an air conditioning machine; such a device 
would be very useful in geographical regions which are 
polluted. 
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The invention also relates to a process for screening a 
molecule, preferably odorants or flavors, which 
comprises passing the molecule over a substrate which 
comprises a polypeptide according to the invention or 
LCN1 , RBP and apolipoprotein D, bound to said 
substrate, said polypeptide binding said odorant or 
flavor, and recovering said odorant or flavor from the 
polypeptide if necessary. The OBPs according to the 
invention may thus make it possible to isolate odorants 
or fragrances , by binding them to a support as described 
above but which, in this case, may be, for example, a 
chromatography column, by passing over this column the 
product from which it is desired to capture the 
odorants. It is, of course, possible, still in this 
same type of application, to carry out an analysis of a 
complex fragrance using the differences in retention of 
the various products by the OBPs. 

The isolation process described above may also be used 
to separate particular "odorants", especially human 
pheromones . 

The hOBPIIs according to the present invention may be 
used for agrofoods or industrial applications. 

The OBPs according to the present invention, by virtue 
of their characteristics, may make it possible to 
solubilize certain lipophilic molecules by associating 
them with said OBPs. Thus, the polypeptides according 
to the invention may be used in combination with 
dietary fatty acids by way of a food supplement. The 
present invention relates to a process for solubilizing 
lipophilic molecules, characterized in that it 
comprises binding said lipophilic molecule to a 
polypeptide according to the invention, to LCN1 , to RBP 
or to apoD . 

The OBPIIs, like LCN1, RBP and ApoD, may be involved in 
the transport of fatty acids and in the biological 
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mechanisms for detecting the fatty acid load of the 
food intake, in particular in the buccal region. The 
invention therefore also relates to the use of the 
polypeptides above, in combination with fatty acids, 
for decreasing the consumption of fatty acids, in 
particular in hyper lipidemi as or in obesity. The poly- 
peptides above may therefore be used for treating 
hyperlipidemias and obesity. Specifically, these 
proteins contribute to the detection of the fatty acid 
content in the food taken in (Gilbertson, 1998) ; an 
excess of these proteins should lead to the physio- 
logical system which detects the fatty load of a food 
intake being deceived. Thus, a food portion which is 
low in fat but supplemented with OBP, ApoD, RBP or LCNl 
pre-loaded with fatty acid will be falsely identified 
as being rich in fat. 

Another application is to supplement a nonmaternal milk 
with one or more of the polypeptides described above. 

The OBPs according to the present invention may be used 
in preventive and curative therapy. 

Thus, a lack of detection of this type of protein in a 
biological specimen using an antibody, a primer and/or 
a probe according to the invention may be an element 
for the diagnosis of anosmia. 

Similarly, it is possible to envision the detection of 
anti-OBPII antibodies in a biological sample of human 
origin; the presence or the titer of these antibodies 
may be related to certain types of allergy, in 
particular in asthmatics. In this case, it may be 
possible to treat this type of allergy by administering 
polypeptide fragments as described, which would 
decrease the immune reactions to the external 
allergies . 
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The invention therefore relates to a process for 
detecting antibodies against human OBPIIs (hOBPIIs) in 
human serum from an allergic and /or asthmatic patient 
using an hOBPII polypeptide according to the invention. 

According to another embodiment, the invention relates 
to a process for detecting antibodies against human 
OBPIIs (hOBPIIs) in a biological fluid from a patient 
suffering from a cancer, and in particular from 
prostate cancer and/or from breast cancer and/or from 
uterine cancer and/or from ovarian cancer and/or from 
lung cancer. Specifically, the proteins according to 
the invention are overexpressed in tumors, and in 
particular in prostate cancer {US 58043 68, WO 97 10503, 
CS 84 02898, CS 83 02012, CS 82 08506, CS 82 08215), 
breast cancer (Stoecz and Gould, 1995; Simard et al . , 
1992), uterine cancer, ovarian cancer and lung cancer. 

The OBPs according to the present invention may also be 
used in pharmaceutical compositions, this being in 
particular in order to deliver certain drugs. 

In fact, lipocalins are used by mammals to transport 
hydrophobic molecules in biological fluids. It even 
appears that they are the natural transporters of 
xenobiotics in vivo. For example, an experimental over- 
load of xenobiotics creates, in male rats, tumors in 
the proximal convoluted tube (PCT) of the nephron 
(Borghoff et al . , 1990). Specifically, the MUP proteins, 
only produced in male rats, are reabsorbed by the PCT 
cells; after lysosomal degradation, the MUP proteins 
release the xenobiotics which they were transporting. 
These xenobiotics accumulate in these cells and lead 
them toward a tumor pathway by mutagenesis. Lipocalins 
therefore appear to be the ideal structure to trap, in 
their calyx, a molecule used as a medicinal product, 
thus avoiding a general distribution thereof. Thus, 
G. Beste et al . (1999) and application WO 99 16873 
describe a strategy of mutagenesis and screening 



variants of a lipocalin having optimum affinity for a 
given ligand. It is therefore possible to create a 
highly specific "medicinal product cage" . 

The present invention therefore relates to a polypeptide 
according to the invention, as an agent for targeting a 
pharmaceutical compound. The expression "agent for 
targeting" is intended to denote the polypeptides of 
the invention capable of transporting and releasing the 
ligand to which they are bound, at the level of certain 
target tissues or cells which have, at their surface, 
receptors for said polypeptides. Thus, the invention 
relates to the transport of a medicinal product in the 
cage of a polypeptide according to the invention, and 
targeting cells through the ability of the OBPII to 
bind to a specific receptor. 

It is also within the scope of the invention to develop 
fusion proteins or chimeric proteins comprising the 
cage corresponding to the lipocalins according to the 
invention combined with a protein which allows specific 
cellular addressing, generally different from that 
which is natural for the OBPIIs. A subject of the 
present invention is therefore a polypeptide, charac- 
terized in that said polypeptide is expressed in the 
form of a protein from fusion with a protein which 
allows specific cellular addressing. Among the proteins 
which allow specific cellular addressing, mention 
should be made of interleukins , of cytokines, of 
lymphokines, of chemokines, of growth factors, of 
hormones, and of monoclonal or polyclonal antibodies. 
The interleukins, cytokines and lymphokines are chosen 
from a group preferably composed of the interleukins 
II-l to 11-20, and the interferons oc-IFN, P-IFN and 
Y-IFN. The growth factors are preferably chosen from 
the colony stimulating factors G-CSF, GM-CSF, M-CSF, 
and erythropoietin. The hormones are preferably chosen 
from steroid hormones . 



According to another embodiment, the polypeptide 
according to the invention, as an agent for targeting a 
pharmaceutical compound, is combined with a molecule 
which allows specific cellular addressing. Among the 
molecules which allow specific cellular addressing, 
mention should be made of the group of steroids, of 
interleukins , of cytokines, of lymphokines , of 
interferons, of growth factors, of hormones and of 
antibodies. The molecule is preferably a steroid. The 
combination of the polypeptide according to the 
invention and said molecule is produced either via a 
noncovalent attachment, using for example the avidin- 
biotin system, or via a covalent attachment, using for 
example chemical bridging agents. 

Among the pharmaceutical compounds which the poly- 
peptide according to the invention may transport and 
target, mention should be made of medicinal products, 
and in particular anticancer agents. The anticancer 
agents are selected from the group of antiproliferative, 
antineoplastic or cytotoxic agents, and are used to 
arrest the development of cancers and to induce 
regression and/or elimination of the tumor mass. These 
anticancer agents are preferably radioisotopes, and 
even more preferably gamma -ray- emit ting radioisotopes, 
such as iodine 131 , yttrium 90 , gold 199 , palladium 100 , 
copper 67 , bismuth 217 and antimony 211 . Beta- and alpha-ray- 
emitting radioisotopes may also be used for the 
therapy. The non-isotopic anticancer agents bound to 
the polypeptide according to the invention are many and 
varied; mention may be made of: (i) antimetabolites, 
such as anti folate agents, methotrexate, (ii) purine 
and pyrimidine analogs (mercaptopurine, f luorouracil , 
5-azacytidine) , (iii) antibiotics, (iv) lectins (ricin, 
abrin) and (iv) bacterial toxins (diphtheria toxin) ; 
the toxins are preferably chosen from Pseudomonas 
exotoxin A, diphtheria toxin, cholera toxin, Bacillus 
anthrox toxin. Pertussis toxin. Shigella Shiga toxin, 
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Shiga-like toxin, Escherichia coli toxins, colicin A, 
d-endotoxin and Haemophilus A hemagglutinin. 

The invention also relates to a pharmaceutical composi- 
5 tion comprising a pharmaceutical compound bound to at 
least one polypeptide according to the invention, and a 
pharmaceutically acceptable vehicle. It is also within 
the scope of the invention to increase the binding 
capacity by mutagenesis of this same protein. The 

10 invention also relates to a pharmaceutical composition 
as defined above, for treating cancer, and in 
particular prostate cancer, breast cancer, uterine 
cancer, ovarian cancer, liver cancer and pulmonary 
epithelial cell carcinoma. Given that the OBP named 

15 hOBPIIb, apolipoprotein D, RBP and OC-l-acid 
glycoprotein are expressed in the prostate, the 
pharmaceutical composition according to the invention 
is intended for the treatment of prostate cancer. 
Similarly, given that it has been demonstrated that the 

20 mRNA of the ApoD, RBP and LCNl proteins is produced in 
the mammary gland, in addition to the production of 
hOBPIIb already described in application WO 99 07740, 
the pharmaceutical composition according to the 
invention is intended for the treatment of breast 

25 cancer. 

The invention also relates to a polypeptide according 
to the invention, as a pharmaceutical compound 
transporter. The term "transporter" is intended to 

3 0 denote polypeptides according to the invention capable 
of vehicling a pharmaceutical compound, in the body, 
without said compound being released at a preferred 
site in the body. Such a polypeptide constitutes a 
means for delivering said pharmaceutical compound in 

3 5 the body. 

The present invention also relates to a pharmaceutical 
composition according to the invention, characterized 
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in that said polypeptide constitutes a delayed form of 
delivery of said pharmaceutical compound in the body. 

Specifically, the pharmaceutical compound of interest 
bound to the OBPII polypeptide according to the 
invention gradually diffuses in the body as said 
transporter polypeptide is catabolized in the body. The 
use of recombinant DNA techniques allows those skilled 
in the art to modify the half-life duration of the 
OBPII polypeptide in the body by introducing 
modifications into said polypeptide. Thus, it may be 
advantageous to develop a homologous polypeptide 
according to the invention, for which the protease 
cleavage sites have been mutated in order to increase 
the half -life of said polypeptide in the body. It may 
also be advantageous to develop a multimeric poly- 
peptide, expressed for example in the form of a fusion 
protein, in order to avoid a "glomerular escape" (renal 
clearance) and thus to increase the half-life of said 
polypeptide in the body. 

According to another aspect, the invention relates to a 
compound characterized in that it is chosen from an 
antibody, a polypeptide, a ligand, a polynucleotide, an 
25 oligonucleotide and a vector according to the 
invention, as a medicinal product, and in particular as 
active principles of a medicinal product; these 
compounds will preferentially be in soluble form, and 
combined with a pharmaceutical ly acceptable vehicle. 
30 The term "pharmaceutically acceptable vehicle" is 
intended to denote any type of vehicle conventionally 
used in the preparation of injectable compositions, 
i.e. a diluent and/or a suspending agent, such as an 
isotonic or buffered saline solution. These compounds 
35 will preferably be administered systemically , in 
particular intravenously, intramuscularly, intradermally 
or orally. Their optimal methods of administration, 
doses and pharmaceutical forms may be determined 
according to the criteria generally taken into account 



10 



15 



in establishing a treatment suitable for a patient, 
such as for example the age or bodyweight of the 
patient, the seriousness of his or her general 
condition, the tolerance to the treatment and the side 
effects observed, etc. 

The invention also relates to a pharmaceutical 
composition comprising an expression vector according 
to the invention and a pharmaceutically acceptable 
vehicle . 

It is also possible to use the property of expression 
in particular sites of these proteins in order to 
provide imaging and/or a diagnosis of certain types of 
cancer . 

Finally, given the high expression of the polypeptides 
according to the invention in the placenta (cf. 
figure 10) , the present invention provides various 
therapeutic applications intended to treat pathological 
conditions of gestation in women. 

Thus, one of the objects of the present invention is to 
use the polypeptide according to the invention for 
preparing a medicinal product intended to transport 
compounds across the placental barrier. These compounds 
are preferably lipophilic molecules. The polypeptide 
according to the invention may be used to transport 
compounds from the gestating mother to the fetus; the 
compounds possibly being chosen from the molecules of 
therapeutic interest, which are preferably lipophilic, 
essential fatty acids, hormones, vitamins and, more 
generally, small lipophilic molecules. 

In fact, it has been demonstrated that the cells of the 
placental villi secrete massive amounts of the poly- 
peptide according to the invention, in particular the 
OBPIIs. These polypeptides then spread through the 
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fetal blood and to the general blood circulation of the 
fetus . 

In addition, the polypeptide according to the invention 
5 may be used to transport a compound from the fetus to 
the gestating mother. Specifically, the fetus secretes, 
via the villi of the nasal structures, polypeptides 
according to the invention which participate in 
molecule exchanges between the fetus and the mother at 

10 the amniotic membrane. An object of the present 
invention is therefore also to provide the fetus with 
polypeptides according to the invention so as to allow 
the transport of compounds present in the fetus, for 
example xenobiotics, to the placenta and/or to the 

15 amniotic membrane, in order to allow the detoxification 
of the fetus. 

According to another aspect of the invention, the poly- 
peptide according to the invention may be used as a 

2 0 pregnancy marker. Specifically, the polypeptide 

according to the invention is present in various 
biological liquids of the mother, such as the urine or 
blood; measuring the concentration thereof and/or the 
presence thereof constitutes a pregnancy marker. In 

25 this capacity, this marker may be used with, or in 
place of, or in addition to the pregnancy marker 
constituted by human choriogonadotrophin, hCG; the 
polypeptide according to the invention and hCG are, in 
fact, molecules which are secreted by the trophoblastic 

30 placental cells. 

Finally, according to another aspect of the invention, 
the polypeptide may be used as a marker for feto- 
placental pathological conditions. It may constitute a 

3 5 marker for the rupture of the amniotic sack membrane. 

Other characteristics and advantages of the invention 
appear in the remainder of the description with the 
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examples and the figures, the legends of which are 
represented below. 

FIGURE 1: Genomic organization of the LCNl/hOBPII locus 

5 

The top line represents the region of chromosome 9q34. 
The double-headed arrow indicates the gap between the 
location of the polymorphic markers D9S1811 and D9S67; 
the relative position of the markers D9S67 and D9S1826 
10 is uncertain. 

The middle level indicates the partial organization of 
the cosmids at 3 different loci: LCNlc, LCNlb-hOBP IIb/ 
LCNl-hOBP IIa . The cosmids (approximately 4 0 kb) are 

15 represented by horizontal lines with their name and the 
arms of the T3 and T7 vectors noted below the line. The 
arrows represent the various genes or pseudogenes with 
their respective orientation. In the cosmids, the 
vertical dotted lines represent the EcoRI sites. The 

2 0 symbol ® indicates an uncertain orientation of the 
locus . 

The bottom level shows the intron/exon structures of 
the LCN1 and hOBPII genes: the black boxes represent 

2 5 the exons; the arrows represent the transcription 

initiation site; oil to ol5 represent the oligonucleo- 
tide probes used to screen the cosmids; Alu represents 
the presence of repeat sequences . 

3 0 FIGURE 2: Dot Plot Analysis of: 

(A) sequence of the LCNl-hOBPIIa locus (AC000396+ACXXXXX) 
versus the LCNlb-hOBPIIb locus (AC002098); 

(B) sequence of the LCNl-hOBPIIa locus (AC000396+ACXXXXX) 
versus the sequence of the LCNlc locus; 

3 5 (C) sequence of the LCNlb-hOBPIIb locus (AC002098) 
versus the sequence of the LCNlc locus (AC002106) . 

Genomic sequences were filtered for the repetitive 
sequences using "Repeatmasker " (REF) , and then compared 
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and dotplotted with the program gcg using a window size 
of 25, and a criterion of 2 0 with an 80% homology. 

FIGURE 3 : Nucleotide sequence of hOBPII genes 

5 

The upper lines represent the hOBPIIa sequence and the 
lower lines represent the hOBPIIb sequence for which 
only the different nucleotides are represented; a dash 
indicates the absence of corresponding sequences. The 

10 shaded capitals are the exon sequences and the lower 
case letters are the intron sequences. The sizes 
indicated on the left are indicated in bp. The TATA box 
is in bold characters and the polyadenylation signal is 
underlined. The boxes indicate the splice acceptor 

15 sites for exons 5, 5b and 5c. 

FIGURE 4: Diagrammatic represent at ion of the two hOBPII 
genes and of their corresponding mRNAs 

2 0 The horizontal lines represent the exon/intron 

organization with sizes indicated in bp. The shaded 
boxes numbered from 1 to 7 are the coding exon 
sequences of the main transcripts; the letters b and c 
refer to supernumerary exons. The various transcripts 
25 are represented using assembled boxes: the first, 
hOBPIIaa and hOBPIIba, correspond to the main 
transcripts, the others correspond to the forms derived 
from alternate splicing. X> indicates a shift in the 
reading frame resulting from the insertion or from the 

3 0 deletion of an exon and * represents a stop codon . The 

letter "a" represents an a-helix and "b" (3-sheets 
predicted by the DSC program. The letters in italics 
are predictions obtained with the "Predator" program. 

35 FIGURE 5: Alignment of the protein sequences derived 
from the two human genes hOBPIIa and hOBPIIb (hOBPIIaa 
= OBP2aaHOMS, hOBPIIba = OBP2baHOMSA, hOBPIIbfJ 
OPB2bbHOMSA, hOBPIIay = OBP 2 agHOMSA , hOBPIIap 
OBP2abHOMSA) , of human tear lipocalins (LCNl_HOMSA) , of 
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rat OBPXI (OBP2_RATNO) , of bovine lactoglobulin BLG 
( LACB BOSTA ) , of mouse MOP (MUP6 MUSMU) , of human RBP 
(RBP_HOMSA) , of bovine OBP (OBP_BOSTA) , of rat MUP 
(MUP__RATNO) and of porcine OBP (OBP_SUSSC) . 

5 

The residues in the dark gray boxes are identical and 
those in the light gray boxes are similar. The elements 
of secondary structure predicted with the DSC program 
are underlined and the amino acid residues are in 

10 italics. The ^-sheets and the cc-helices are numbered 
for hOPBIIa and b. The anticipated signal peptide 
cleavage site is indicated with an arrow (AAaIlS ) at 
position 15 . Nonaligned sequences of very divergent 
forms of spliced genes, hOPBa8 ( OPB2 adHOMSA ) and hOBPby 

15 (OBP2bgHOMSA) have been added at the bottom of the 
alignment after the analysis. 

FIGURE 6 i RT-PCR analysis 

20 (A) Detection of the LCNl , LCNlb and LCNlc RT-PCR 
products with their specific probes, and (B) detection 
of the hOBPIIa and hOBPIIb RT-PCR products with their 
specific probes. (C) Control G3PDH RNA. The sizes are 
indicated in bp. 

25 

FIGURE 7: Tissue location of the hOBP mRNAs 

Sections of the middle meatus (A, B) , of the conchae 
(C, D) , of the prostate (E, F) , of the vas deferens (G, 

3 0 H) , and of the mammary glands (I, J) were hybridized 
with digoxigenin-ll-UTP-labeled hOBP riboprobes . The 
hybridization with a sense hOBP riboprobe did not 
reveal any signal (B, D, F, H, J) . A specific 
hybridization signal is obtained with an antisense 

3 5 riboprobe (A, C, E, G, I) . The arrows indicate 
different structures: AC: acinar cells, EC: epithelial 
cells, GC: glandular cells, SD: secretory duct and L: 
lumen (X2 00) . 
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FIGURE 8 : Phylogenetic distance tree for vertebrate 
lipocalins 

The usual abbreviations for lipocalins were used. 
5 Indicated after the symbol "_" are the first three 
letters of the genus followed by the first two letters 
of the species. 

FIGURE 9: Scheme of the evolution of the LCN1-OBPII 
10 gene subfamily during evolution 

The boxes indicate the exons arranged on a line 
representing the genomic DNA. The "//" between the 
lines indicate that the loci are not consecutive, 
15 without it being possible to determine the order. The 
large cross under the symbol "?" illustrates a partial 
duplication or complete duplication event with a 
subsequent genomic deletion for the LCNlc locus (choice 
indicated by the symbol "*"). The smallest cross 

2 0 indicates the removal of the seventh exon which appears 

to be specific for humans because of the numerous 
repeat sequences Alu present in this genomic region and 
because the rat has two VEGP genes. The symbol "**" 
marks the exon recruitment step based on the data 
25 present and those of the literature; this step may have 
appeared at any time after the duplications of the loci 
and may have been sequential . 

FIGURE 10: RT-PCR analysis, in the oral sphere and the 

3 0 genital sphere, of the expression of the other known 

human lipocalins 

Definition of novel OBP and VEGP functions for the RBP 
(retinol-binding protein) and ApoD (apolipoprotein D) 
35 proteins. 
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FIGURE 11: Study of the polyclonal antibody against the 
hOBPXX proteins 

The fusion protein is loaded in wells 1 and 3. A sample 
of human nasal mucus is loaded in wells 2 and 4. The 
proteins contained in wells 1 and 2 were revealed with 
Coomassie blue staining. The antibody hybridized on the 
membrane obtained by Western blotting specifically 
reveals the GST-hOBPIIb protein between 3 0 and 42 kDa 
(well 3) and the 18 kDa hOBPIIa and hOBPIIb (well 4) in 
the human nasal mucus . 

FIGURE 12: Immunohistochemistry on the tissues of the 
olfactory apparatus 

Strong immunoreactivity can be observed on the section 
of septum (green staining, A) on the two sections of 
concha (C and D) and on the section of middle meatus 
(F) . The negative controls prepared for the septum (B) , 
for the concha (E) and for the middle meatus (G) show 
no reactivity. The cell nuclei are located by DAPI 
staining (blue staining) . The magnifications used are 
X 100 for the sections A, B, C, E and G, and X 200 for 
the sections D and F. 

FIGURE 13 : Immunohistochemistry on tissues of the oral 
sphere 

These two tissues, lacrymal glands (A, B) and von 
Ebner's glands (E, F) show strong immunoreactivity 
compared to the results obtained with the pre-immune 
serum (C for the lacrymal glands and D for the von 
Ebner's glands). Magnification X 200. 

FIGURE 14 : Immunohistochemistry on the mammary glands 
and lung 

Strong immunoreactivity is visible on the sections of 
mammary glands (A and B) compared to the negative 



control (C) prepared using pre -immune serum. Strong 
immunoreactivity is also observed on the sections of 
lung (E and F) , again compared to the negative control 
(D) . Each observation is made at a magnification of 
X 100 for A and C and at a magnification of X 200 for 
B, D, E and F. 

FIGURE 15: Demonstration of the hOBPII proteins in 
various nasal structures: concha (A: 2 0 Ox, B: 400x) , 
septum (C: 300x, D: 600x), middle meatus (E: 400x, F: 
600x) 

Detection of the hOBPII proteins with a rabbit 
polyclonal serum revealed by peroxidase activity 
detected by DAB. AC: secretory acinar cell, P: DAB 
precipitate (hOBPII detection) . 

FIGURE 16: Demonstration of the LCN1 protein in various 
nasal structures: concha (A: 200x, B : 500x) , septum (C: 
lOOx, D: 500x) , middle meatus (E: 200x, F: 600x) 

Detection of the LCN1 protein with a rabbit polyclonal 
serum revealed by peroxidase activity detected by DAB. 
AC: secretory acinar cells, P: DAB precipitate (LCN1 
detection) . 

FIGURE 17 : Demonstration of the ApoD protein in various 
nasal structures: concha (A: 500x, B: 700x) , septum (C: 
300x) , middle meatus (D: 400x) 

Detection of the ApoD protein with a mouse monoclonal 
antibody revealed by peroxidase activity detected by 
DAB. AC: secretory acinar cell, P: DAB precipitate 
(ApoD detection) . 
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FIGURE 18: Demonstration of the hOBPII protein in 
various structures of the oral sphere: lacrymal gland 
(A: 20 Ox, B: 600x) , von Ebner's glands (C: 100x, D: 
400x), lung (E: 200x, 800x) 

Detection of the hOBPII protein with a rabbit poly- 
clonal serum revealed by peroxidase activity detected 
by DAB. AC: secretory acinar cell, P: DAB precipitate 
(hOBPII detection) . 

FIGURE 19: Demonstration of the hOBPII proteins in 
various placental structures: chorionic villi (A: 300x, 
B: 800x), amniotic membrane (C: 800x), cord (D: 600x) 

Detection of the hOBPII proteins with a rabbit poly- 
clonal serum revealed by peroxidase activity detected 
by DAB. L: lacuna (contains maternal blood), C: fetal 
capillary, Am: amniotic membrane, E: epithelial cells 
derived from the extra-embryonic ectoblast, MB: 
basement membrane, M: mesenchymal layer, P: DAB 
precipitate (hOBPII detection) . 

FIGURE 20: Demonstration of the LCN1 protein in various 
placental structures: chorionic villi (A: 200x # B : 
80 Ox) , amniotic membrane (C: 600x) , cord (D: 50 Ox) 

Detection of the LCN1 protein with a rabbit polyclonal 
serum revealed by peroxidase activity detected by DAB. 
L: lacuna (contains maternal blood), C: fetal capillary, 
Am: amniotic membrane, E: epithelial cells derived from 
the extra-embryonic ectoblast, MB: basement membrane, 
M: mesenchymal layer, P: DAB precipitate (LCNl 
detection) . 
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FIGURE 21s Demonstration of the ApoD protein in various 
placental structures; chorionic villi (As lOOx, Bs 
500x) , amniotic membrane (Cs 400x) , cord (Ds 400x) 

5 Detection of the ApoD protein with a mouse monoclonal 
antibody revealed by peroxidase activity detected by 
DAB. L: lacuna (contains maternal blood), C: fetal 
capillary, Am: amniotic membrane, E: epithelial cells 
derived from the extra- embryonic ectoblast, MB: basement 
10 membrane, M: mesenchymal layer, P: DAB precipitate 
(ApoD detection) . 

EXAMPLES 

15 EXAMPLE Is MATERIALS AND METHODS 

1 .A. Human chromosome 9 cosmid library 

A copy of the library of cosmids specific for human 
20 chromosome 9, LL09NC01P, constructed by Dr. J. Allmeman 
(Biochemical Sciences Division, Lawrence Livermore 
National Library, Livermore, CA USA) under the aegis of 
the National Gene Library Project, supported financially 
by the American Department of Energy, was used. The 
25 library was screened and the clones analyzed as 
previously described (Lacazette et al . , 1997). 

1 . B . Cloning and sequence analysis 

30 A Lambda gtll human testes cDNA library (Clontech) (10 7 
p.f.u.) was amplified by 3 0 polymerase chain reaction 
(PCR) cycles (94°C 45 sec, 54°C 45 sec, 72°C 1 min 
3 0 sec) with the primer oliEST58 CCTGCAGGTACATGAGCTTCC 
and 5 ' or 3 ' screening amplimers for inserts located on 

35 the arms of the lambda gtll vectors. A nested PCR was 
then carried out with oliEST2 6 CGCTGTATTTGCCAGGCTCC and 
oligonucleotides specific for the arm of the vector. 
The PGR products were subcloned into the pGEM-T(r) 
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vector, which made it possible to obtain the 5' end of 
the hOBPII gene cDNAs . 



The sequences obtained using the standard oligonucleo- 
5 tide pGEM-T(r) and a dye terminator-based sequencing 
ready reaction mix (Applied Biosystems) were separated 
by electrophoresis using an ABI PRISM 377 automatic 
sequencer (Perkin Elmer) and were then analyzed with 
the sequence nagivator 1.0.1. software (Perkin Elmer). 

10 Full length cDNA clones of hOBPIIa (hOBPIIaa, hOBPIIap, 
hOBPIIa8, hOBPIIay) and hOBPIIb (hOBPIIba, hOBPIIbp, 
hOBPIIby) were obtained from the RT-PCR by purifying 
the bands of interest according to the manufacturer's 
instructions (Qiagen gel extraction kit) or, for the 

15 weakly expressed alternative forms, by subcloning the 
nested PCR products into a pGEM-T(r) vector. 

l.C. RT-PCR analysis 

2 0 Tissue samples were collected from 45 to 55 year old 

Caucasian individuals, in agreement with the French 
regulations in force. The total RNA is extracted 
according to a single-step method using the RNA NOW® 
reagent, according to the manufacturer's (Biogentex) 
25 instructions. 5 fig of total RNA were reverse-transcribed 
in a final volume of 2 0 fxl using 0.5 ng of oligo- 

Superscript® preamplif ication system (Gibco BRL) . 
Three jxl of these reactions were then used for the 
30 following PCRs . The expression of the specific mRNAs 
was determined by PCR using: the primers TL: 
CCTCTCCCAGCCCCAGCAAG and AP : G AC TCG AGTC G AC ATC G for the 
LCNl-type genes (LCN1, LCNlb, LCNlc) and, for the 
hOBPII-type genes, the primers DE : 

3 5 CGCCCAGTGACCTGCCGAGGTC , and FI : 

CTTTATTTGGAGTCAGGTGGGTG . As a quality control for the 
RNAs the primers G3PDH1: CTCTGCCCCCTCTGCTGATG and 
G3PDH2: CCTGCTTCACCACCTTCTTG for the G3PDH gene were 
used; the G3PDH gene is considered to be consti tutively 
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expressed in all cell types. 32 PCR cycles (94°C 
45 sec, 54°C 45 sec, 72°C 2 min 30 sec) were performed 
and the amplification products were separated on 1% 
agarose gel . The DNA was transferred onto a Hybond N+® 
5 membrane . 

For the detection of -the expression of the various 
genes, several oligonucleotides specific for the 
respective genes were synthesized: 

10 

- olLCNl: GACTCAGACTCCGGAGATGA , 

- olLCNlb: AACTCAGACACCAGAGATGA, 

- olLCNlc: GACTCAGATCCCGGAGATGA, 

- ol5: CCAGGAGGGACCACTACA specific for the hOBPIIb gene, 
15 - ol4: CCGGGACGGACGACTACG specific for the hOBPIIa gene, 

- G3PDH3: CTCATGACCACAGTCCATGC . 

The oligonucleotides are labeled with y 32 P-ATP using T4 
kinase (Applied Biosystem) ; the labeled oligonucleotide 

2 0 hybridizations are performed at 42 °C. The final wash is 
performed in a solution of 0.1 X SSC, 0.1% SDS at 48°C 
for 20 min. The specificity of the oligonucleotide 
hybridization reactions is controlled using samples of 
digested cosmid DNA (p233G2 for LCNl and hOBPIIa, P19E7 

25 for LCNlb and hOBPIIb, and P181A9 for LCNlc) loaded 
onto the gel with RT-PCR products. 

l.D. Genotyping and linkage analysis 

30 The genotyping is carried out with PCR reactions using 
100 ng of genomic DNA originating from 8 CEPH reference 
families, and using the oligonucleotides oli9 
TGTTCGGGAACGC AGC TT and olilOa TGCCGCTGTCCCCACGTCGG . The 
thermocycler parameters consist of an initial cycle at 

35 94°C for 10 min followed by 30 cycles at 94°C for 
30 sec, 55°C for 30 sec and 70°C for 45 sec, and then a 
final elongation step of 10 min at 70°C. The PCR 
products are then analyzed on a 3% agarose gel. The 
information regarding the markers for chromosome 9 may 
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be obtained at the following internet address 
(hTTP : / /gal ton . ucl . ac . uk) ; the analyses were carried 
out using the linkage study tools previously described 
in Lacazette et al . (1997) . The haplotypes are 
5 reconstructed manually according to the recombination 
events previously described in family 1362 (Attwood et 
al. , 1994) . 

I.E. Secondary structure predictions 

10 

A multiple alignment of the lipocalin proteins for 
which the crystallographic structures have already been 
described (Monaco et al . , 1992; Spinelli et al . , 1998), 
with the hOBPIIa and hOBPIIb proteins, was obtained 

15 using the Clustalx software (ftp.infobiogen.fr). The 
putative secondary structures were determined with the 
DSC (Discrimination of protein Secondary structure 
Class) program developed by R.D. King and M.J.E. 
Sternberg (http: //bioweb.pasteur . fr/seqanal/ inter faces/ 

20 dsc-simple.html) . The secondary structures of the 
proteins corresponding to the alternatively spliced 
forms are presumed to be identical to the conventional 
forms before the reading frame shift; after the shift, 
the structural prediction is made with a single 

2 5 sequence and is performed using the Predator software 

(http://pbil.ibcp.fr/cgi-bin) . 

l.F. Phylogenetic analysis 

3 0 The lipocalin proteins, the sequences of which are 

entirely known, were aligned three times consecutively 
using the Clustalx software (ftp://ftp.infobiogen.fr). 
Distances in the phylogenetic tree were calculated with 
Clustalx and plotted with Njplot. 

35 

l.G. In situ hybridization 



Serial sections cut on a cryostat (8 i*m thick) are 
collected on SuperFrost® Plus slides (Menzel Glazer) 
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and stored at -80°C. Antisense and sense RNA probes are 
obtained according to standard techniques using T7 or 
SP6 polymerase, from a matrix obtained by digestion of 
the phOBPIIaP2 cDNA clone with Ncol or PstI restriction 
5 enzymes, and using DIG-11-UTP (Boehringer Mannheim) 
(the probe is approximately 150 nucleotides long) . The 
matrices digested with PstI and transcribed with T7 RNA 
polymerase correspond to the antisense probe and the 
matrices digested with Ncol and transcribed with the 

10 SP6 RNA polymerase correspond to the sense probe. 
Tissue sections are fixed in 4% paraformaldehyde for 
15 min and then rinsed for 5 min in 2 X PBS. After 
acetylation (2x5 min in a triethanolamine (TEA) 
buffer, pH 8, containing 0.25%, v/v, of acetic 

15 anhydride) , the tissue sections are prehybridized at 
60°C for 15 min in 50% formamide/1 X SSC . The labeled 
probes are applied to each of the sections in 50 pil of 
hybridization buffer (50% formamide, 1 X Denhardt's, 
500 ng/ml total tRNA, 10% Dextran sulfate, 10 mM 

2 0 di thiothrei tol ) . The sections are covered and then 
incubated in humid chambers at 50°C overnight. After 
hybridization, the slides are immersed at 55°C in a 
washing buffer (50% formamide, 1 X SSC) for 2 hours. 
Each slide is then rinsed twice 5 min in 2 X SSC at 

25 ambient temperature, and then treated for 30 min with 
10 mg/ml of RNAse at 37°C and, finally, immersed in a 
washing solution (50% formamide, 2 X SSC) for 2 hours 
at 55°C. The slides are then placed in 0.1 X SSC for 
15 min at 55°C. The immunodetection is performed using 

30 an alkaline phosphatase-con jugated anti-DIG antibody 
(Fab fragments) according to the Boehringer Mannheim 
protocol. The sections are examined at different 
magnifications using an Axiophot microscope (Zeiss) . 

35 l.H. Production of a monoclonal antibody against the 
hOPBn proteins 



Two hundred and fifty [il of fusion proteins were given 
to the company Agro-Bio for the production of 
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antibodies. Rabbits (New Zealand White SPF) are injected 
on days 0, 14, 2 8 and 42. The sera are taken on days 0, 
35, 49 and 63. In order to test the antibody, the 
fusion protein is loaded in lanes 1 and 3 (figure 11), 
while human nasal mucus is loaded in lanes 2 and 4. 
Lanes 1 and 2, and also the lane corresponding to the 
molecular weight ladder, correspond to revelation with 
Coomassie blue. Lanes 3 and 4 correspond to detection 
of the proteins recognized by the antibody using the 
Western blot technique. Lanes 1 and 3 show the presence 
of a series of truncated recombinant proteins between 
30 and 42 kDa derived from a relatively inefficient 
synthesis due to the presence of many codons which are 
rare in the bacterium, within the hOBPIIb sequence. 
However, this production of recombinant protein was 
sufficient to produce a polyclonal antibody of good 
quality, as indicated by the revelation, in lane 4 
corresponding to the nasal mucus, of an 18 kDa band 
specific for the hOBPn proteins. 

1 . I . Immunohi stochemi stry 

1.1.1. 8 |Wm serial sections of various tissues are 
fixed with paraformaldehyde (5 min) and rinsed three 
times with 1 X PBS (15 min) then incubated for 30 min 
in a 3% solution of BSA in 1 X PBS, before being 
incubated overnight in the presence of the anti- fusion 
protein antibody in a 1 X PBS solution. After three 
rinses in 1 X PBS (15 min) , a rabbit anti-IgG antibody 
coupled to FITC (green staining by fluorescence) in a 
1 X PBS solution is placed in contact with the slides 
for 3 hours. After rinsing in 1 X PBS, a solution of 
DAPI (100 ng/fil in 1 X PBS) is applied for 10 min 
(counterstaining of the cell nuclei in blue) . After 
three rinses in 1 X PBS (15 min) the slides are mounted 
in water-glycerol (50/50). The analysis is carried out 
in the presence of DAPI and FITC filters with the aid 
of a CDD camera using an integration time of between 4 
and 32 ms. 



"X. CI; w 2, "A : ' k-. " «. \ "> ; «r'i; C'Y. t- : '; 



- 45 - 

1.1.2. The tissues are frozen at -80°C in the presence 
of OCT. 7 micrometer serial sections are cut using a 
microtome. The tissues are then fixed with 4% para- 
5 formaldehyde in PBS. The slides are incubated for 
10 min in a solution of methanol in the presence of 
4.5% of H 2 0 2 , and then rinsed three times in PBS for 
5 min. After incubation for 3 0 min in a 3% bovine serum 
albumin solution, the first antibody (diluted to 1/350) 

10 is incubated overnight at ambient temperature (rabbit 
polyclonal antibodies for hOBPII and LCNl, and mouse 
monoclonal antibody for ApoD) . Then, after three 15 min 
rinses in PBS, the second antibody (diluted to 1/300) 
(anti-rabbit for hOBPII and LCNl, and anti-mouse for 

15 ApoD) is incubated for 3 h at ambient temperature. 
After three rinses in PBS for 15 min, peroxidase 
revelation with DAB is carried out for 2 to 10 min, 
using the "Vector" kit. The slides are then rinsed with 
water and counterstained with Mayer's hematoxylin for 

20 2 min. After two rinses with water, the slides are 
dehydrated by passing them successively through baths 
containing 70°, 95°, 100° and 100° ethanol solutions 
and, finally, two baths of toluene. The slides are then 
mounted in Canada balsam and observed using a Zeiss 

25 microscope. 

EXAMPLE 2: IDENTIFICATION OF A GENE HOMOLOGOUS TO LCNl 
LOCATED ON HUMAN CHROMOSOME 9 

3 0 The identification of the LCNl complementary DNA 
encoding the lipocalin from human tears (Lassagne and 
Gachon, 1993) has previously been reported, as has the 
location of the gene on chromosome 9q3 4 (Lassagne et 
al, 1993; Lacazette et al , 1997) . The two genes 

3 5 encoding the von Ebner ' s gland proteins 1 and 2 (Kock 
et al, 1994) correspond to the proteins homologous to 
LCNl in rats; this poses the question of whether other 
genes encoding an LCNl protein exist in the human 
genome. In situ hybridization experiments (Lassagne et 
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al, 1993) and also analyses with somatic hybrids 
(Lassagne et al , 1995) indicate that, if they exist, 
these additional human genes must be located in 
chromosomal region 9q34 . 

A library of cosmids specific for human chromosome 9 
(LL09NP01) was screened with the human LCN1 cDNA probe; 
2 6 cosmids were identified; they were digested with 
EcoRI or PvuII, and then hybridized successively with 
the LCN1 DNA and various oligonucleotides (figure 1) . 

The cosmids are divided into 3 groups. The first group 
(clones P32H3, P41B5, P63B6, P92H20, P109C6, P145H6, 
P195B4, P233G2, P233F2, P265D4 and P276H8) correspond 
to the cosmids containing a sequence of the previously 
isolated LCN1 gene (accession number: L14927) made up 
of 7 exons (Holzfeind and Redl , 1994). The second group 
(clone P19E4, P19E7, P42H9, P98H5 and P142H8) corres- 
ponds to the sequence LCNlb homologous to LCN1 (clone 
P19E4) (accession number: Y10826) from the promoter up 
to the 6th exon, which then diverges. A third group of 
cosmids (clone P110C1, P174E4, P174E5, P181A9, P181B6, 
P211A7, P238C6 and P291E1) contains a region LCNlc, 
established from the partial sequencing of the clone 
P181A9 (accession number Y10827) , which is highly 
homologous to LCNl only from the promoter up to exon 2 . 
Thus, LCNl is the only gene which contains the 7th and 
last exon. In addition, the TATA boxes are degenerate 
in the promoters of the LCNlb and LCNlc genes . 

EXAMPLE 3: IDENTIFICATION OF GENOMIC DUPLICATIONS 
CONTAINING THE LIPOCALIN GENES AND MAPPING 
OF CHROMOSOME REGION 9Q34 

During the period in time over which the LCNl gene 
family was identified, a vast project of physical 
mapping led to the identification of cosmid contigs 
partially covering chromosome region 9q34 (Nahmias et 
al, 1994, Van Slegtenhorst et al . , 1995, Hornigold et 
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al, 1997); the sequencing of the corresponding clones 
was performed by a group from the Massachusetts 
Institute of Technology (Boston, USA) . 

5 Analyses comparing sequences in the database, for LCN1 , 
LCNlb and LCNlc, revealed strong homologies between the 
previously described LCNl gene (accession number 
L14927) and the three cosmid sequences: P161A1 having 
the accession number AC002098, P203H12 with the 
10 accession number AC000396 and P161G2 with the accession 
number AC002106. 

A very fine analysis, in particular of the 3' region of 
the LCNl genes, revealed that the LCNlc sequence 

15 corresponded to the clone having the accession number 
AC002106, LCNlb to the number AC002 098 and LCNl to the 
number ACO 00396; there is, however, an insertion of 
60 base pairs at position 123 60 of the clone having the 
accession number AC000396 compared to that having the 

20 number L14927. Additional analyses revealed that these 
sequences are homologous over a region which is much 
larger than the LCNl genes. As illustrated by the dot 
plot analyses (figure 2), these three cosmids corres- 
pond to duplicated genomic regions; the cosmids P161A1 

25 (AC002098) and P203H12 (AC000396) are homologous over 
their entire sequences; the cosmid P161G2 (AC002106) is 
homologous to the sequence only for sequences located 
upstream of intron 3 of the LCNl gene. 

30 The relative positions of these duplicated regions of 
human chromosome 9 were analyzed. The results were 
compared with the physical map established in the 
context of the physical mapping project by restriction 
analyses of the cosmids (Hornigold et al , 1997) using 

3 5 the same library LLN09NC01P. Approximately the same 
cosmids were found in the two searches for the LCNlc 
locus; the sequence comparison for the cosmid P181A9 
reveals that its T3 end contains the Surf 5 gene 
(figure 1) . In addition, AC002106 is located in a 
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sequence contig between ABO and the Surfeit locus, 
confirming the location of LCNlc . P161A1 and P203H12 
have less precise locations using the data from 
Hornigold and from his collaborators because of the 
5 limit of the restriction mapping strategy for the 
duplicated regions. P203H12 (AC000396) appears to 
correspond to the LCN1 gene which was previously 
located close to the D9S1826 marker (Lacazette et al . , 
1997), with the exception of a 60 base pair insertion. 

10 

In order to test whether P2 03H12 may correspond to a 
4th duplicated region, the region was tested by PCR on 
genomic DNA from 150 unrelated individuals using 
primers specific for AC000396 and L14927 . A single band 

15 corresponding to the length of the AC000396 sequence 
was detected (data not shown) , thus proving that 
P2 03H12 contains the previously described LCNl gene. 
The LCNlb locus was not positioned precisely in the 
region, as demonstrated by the fact that the AC002098 

20 ends and also the ends of the cosmids containing LCNlb 
detect no homologous sequence in the databanks. 

The sequence analysis made it possible to identify a 
putative minisatellite at position 3177-3724 of the 
25 cosmid corresponding to AC002098 (figure 1); a rare 
polymorphism was thus demonstrated in a population of 
2 0 unrelated individuals (PIC = 0.05) . This new poly- 
morphic marker is informative only in family 13 62 of 
the 8 CEPH reference families tested. 

30 

The linkage analysis revealed two-point lod scores 
greater than 3 at theta = 0 for the D9S275 and D9S1818 
markers. The reconstruction of the haplotype confirmed 
the location of the LCNlb gene between the D9S1811 and 
35 D9S67 markers in chromosome 9q34 (figure 1). 

EXAMPLE 4: IDENTIFICATION OF TWO NEW LIPOCALIN GENES 

The sequence analyses revealed new data. 
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The comparison of AC002098 to the databank revealed 
similarities between this sequence and the lipocalin 
genes. In addition to the region 21000 to 27000, which 
5 contains the LCNlb gene, the region around position 
2150 contains sequences which exhibit similarities with 
the sequences encoding the rat type II odorant-binding 
proteins, lipocalins from tears and also the EST 
AA460385 . 

10 

In parallel, the comparison of the AC0003 96 sequence to 
the databank revealed, in addition to the region 11100 
to 17100 containing the LCNl gene, similarities for the 
same group of sequences in the region 36600 to 37800. 

15 

The EST AA460385 expressed in human testes corresponds 
to 4 exons of the new lipocalin gene present in the 
clone P161A1 (AC002098) . A putative 50 base pair exon 
similar to the EST sequence is also present at the end 
20 of the clone P203H12 (AC000396) . 

The inventors have therefore concluded that a new 
lipocalin gene exists at a position distal to LCNlb. 
The inventors have put forward the hypothesis that, 
25 subsequent to a genomic duplication, a second gene 
orthologous to the gene encoding the EST is present in 
the 3 ' region of LCNl . 

The sequencing of the cosmid P233G2 containing this 
3 0 region (figure 1) with oligonucleotides defined from 
the EST in fact revealed the presence of another lipo- 
calin gene at a locus 20 kb distal to the LCNl gene. In 
order to identify the first exons of the two new genes, 
nested PCRs on cDNA clones originating from a testes 
35 library were carried out between the oligonucleotides 
located in the 5' region of the EST and the arms of the 
vector. The PCR products were cloned and their sequence 
revealed three additional exons going by the genomic 
sequence (figure 1) . A TATA box is present upstream of 
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the first exon in both cases (figure 3) . The two mRNAs , 
hOBPIIa corresponding to the gene located downstream of 
LCN1 and hOBPIIb located downstream of LCNlb (figures 3 
and 4), are 97.5% and 63% identical to those corres- 
ponding to the LCN1 gene. The intron/exon organizations 
of these two genes are consistent with the lipocalin 
family . 

In addition, the deduced protein sequences (170 amino 
acids) confirm that they belong to the lipocalin 
family. The two putative mature proteins hOBPIIa (MW = 
17.8 kDa) and hOBPIIb (MW = 18.0 kDa) are 89% identical. 
These two novel proteins contain a putative signal 
peptide of 15 amino acids (figure 5), the conserved 
residues GXWY at position 27 to 30, and two cysteines 
which may be involved in a disulfide bridge (position 
74 to 166) . 

However, the paired sequence comparisons with the other 
lipocalins indicate weak conservation of the amino acid 
residues (15 to 25%), with the exception of human 
TL/VEG encoded by the LCN1 gene (mean value 43% over 
155 amino acids) and rat OBPII (mean value 45.5%). 
Moreover, the calculated isoelectric points of hOBPIIa 
and of hOBPIIb are 7.85 and 8.72, respectively, whereas 
those of the lipocalins are acid (in the region of 
4.5), with the exception of rat OBPII (PI = 9.01). The 
prediction for the secondary structures of the hOBPIIa 
and hOBPIIb proteins with the DSC program, by carrying 
out multiple alignments with the sequences of lipo- 
calins of known structures, indicates the presence of 
8 strands of antiparallel (3-sheets which may allow the 
formation of a calyx, followed by an a-helix and a 
final 3-sheet, in agreement with the known data for the 
structures of the lipocalins (figure 5). 
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EXAMPLE 5: STUDY OF THE EXPRESSION OF THE LIPOCALIN 
GENES IDENTIFIED 

In order to specify whether LCNlb, LCNlc, hOBPIIa and 
5 hOBPIIb are expressed genes, RT-PCR analyses were 
carried out on 18 different human tissues known to 
produce lipocalins (figure 6). 

Two pairs of primers were synthesized; one recognizes 

10 the LCNl-type mRNAs and the other the hOBPII-type 
mRNAs . The hybridization of the oligonucleotides 
specific for the genes, on membranes on which the RT- 
PCR products are immobilized after transfer via the 
Southern technique, and then the subcloning of the 

15 hybridized products revealed that LCNlb and LCNlc are 
not expressed in any of the products tested, whereas 
the LCN1 mRNAs are detected in the lacrymal gland, the 
sweat gland, the von Ebner's glands, the nasal septum, 
the epithelium of the nasal concha, as well as the 

20 placenta and mammary glands (figure 6A) . These data and 
the sequence information, which indicates that the TATA 
box and the last exon have been lost, make it possible 
to affirm that LCNlb and LCNlc are pseudogenes which do 
not contribute to the formation of the human LCN1 

25 proteins. 

On the other hand, the hOBPIIa and hOBPIIb genes are 
expressed, which confirms the previous detection 
thereof in the cDNA libraries. Surprisingly, although 

3 0 both proteins hOBPIIa and hOBPIIb are very similar over 
their entire sequence, including in the 1.5 kb promoter 
region, their expression profiles are different 
(figure 6B) . The hOBPIIa protein is strongly expressed 
in the nasal septum, the middle meatus, the nasal 

35 concha, the testes and the placenta, and more weakly in 
the mammary glands, the lacrymal glands, the sweat 
glands, the von Ebner's glands and the lung. On the 
other hand, the hOBPIIb protein is expressed 
predominantly in the prostate, the testes and the 
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mammary glands, and more weakly in the submaxillary 
glands, the nasal septum and the middle meatus. 

In addition, the RT-PCR analyses revealed the existence 
of alternative splicing of the product of transcription 
of the hOBPIIa gene and of the hOBPIIb gene, which 
generates, respectively, four and three mRNAs 
(figures 3, 4, 5 and 6) . 

The transcription of the hOBPIIa gene generates at 
least four mRNAs which encode four different proteins; 
the first mRNA encodes the hOBPIIaa protein which 
corresponds to the hOBPIIa protein described above. In 
the hOBPIIa gene, three different splice acceptor sites 
have been identified for exon 5 (figures 3 and 4) , thus 
forming two other splice variants. A first splice 
variant has an acceptor site for exon 5 located 49 bp 
before the previous one (exon 5b) ; this generates a 
72 5 nucleotide mRNA which encodes the 14 6 amino acid 
protein hOBPIIaP. This protein is identical up to the 
8th putative P-sheet and then different with only 16 
additional amino acids. A second splice variant has an 
acceptor site for exon 5 located 65 bp before the 
previous one (exon 5c) ; this generates a 741 nucleotide 
mRNA which encodes a 228 amino acid protein hOBPIIay. 
This protein has the first eight putative P-sheets 
identical to those of hOBPIIaa, and is then different 
in the C- terminal region (figure 5) due to a reading 
frame shift generated by this alternative splicing 
event; the structure of this C- terminal region 
predicted by the Predator program is a long bent region 
containing a 9th P-sheet. 

In the case of the hOBPIIb gene, in addition to the 
hOBPIIba mRNA previously described, a 106 bp super- 
numerary exon (exon 3b) between the previous exons 3 
and 4 was identified (figure 3). This longer mRNA 
(782 nucleotides) encodes a 165 amino acid protein 
hOBPIIbP. From a protein structure point of view. 
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hOBPIIb0 is identical to hOBPIIba up to the 5th 
putative (3- sheet and then differs due to a reading 
frame shift. The predictions from the computer programs 
indicate that the motif ALWEALAIDTRLK is an CC-helix 
which is just behind the fifth 3-sheet. Two additional 
P-sheets may be present in the long C-terminal portion. 

None of the alternative splicing variants for one gene 
is detected symmetrically for the other gene, although 
the putative splice acceptor and donor sites may be 
present (figure 3). In addition to these protein motifs 
which conserve the conventional structure of a 
lipocalin, we have demonstrated, for the two genes 
hOBPIIa and hOBPIIb, the existence of a small amount of 
mRNA which has undergone alternative splicing and 
encodes proteins, the sequence of which is not directly 
related to the lipocalins. mRNAs encoding hOBPIIa5 and 
hOBPIIby, which lack the sequence encoding exon 2 and 
which have, respectively, exon 5b and exon 5, encode 
putative secreted proteins of 147 and 85 amino acids, 
respectively (figures 4 and 5); these proteins diverge 
from the previous proteins from the 24th amino acid. 

In order to identify the cells expressing the hOBPII 
genes, the inventors hybridized digoxigenin-labeled 
sense and antisense ribonucleic acid probes on tissue 
sections (figure 7). The hOBPII mRNAs are detected in 
the acinar cells of the middle meatus and of the nasal 
conchae, and also epithelial cells of the conchae; this 
supports the idea that the hOBPII proteins are involved 
in olfactory function. In addition to the production in 
the oral sphere, mRNAs encoding hOBPII were detected in 
the genital sphere, in particular in the glandular 
cells of the prostate, and in the secretory epithelial 
cells of the vas deferens. No signal was detected in 
the male gonads, which suggests that the expression of 
the hOBPII genes detected in the RT-PCR experiments 
corresponded to the presence of additional ducts in the 
tissue preparation (rete testis and lobuli 
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epididymidis) . In combining these results from the 
detection of all of the mRNAs encoding hOBPII with 
those of the RT-PCR approach, it appeared that five 
hOBPII proteins (hOBPIIaa, hOBPIIa0, hOBPIIay, 
hOBPIIba, hOBPIIbp) are secreted by the epithelial 
cells of the ducts of the male gonads, and also acinar 
cells of the middle meatus and of the nasal conchae; in 
these cells, the mRNAs encoding hOBPIIa are highly 
predominant. Only the two hOBPIIb proteins (hOBPIIba 
and hOBPIIbp) are secreted by the epi thelio-glandular 
cells of the prostate and of the mammary glands. 

EXAMPLE 6: PHYLOGENETIC ANALYSIS AND CLASSIFICATION OF 
THE LIPOCALINS 

The dotplot analysis (figure 2) and also the genomic 
duplications which we revealed indicate a common origin 
for the LCN1 and hOBPII genes. To clarify the relation- 
ships between the members of the lipocalin family and 
to verify that we identified the human gene orthologous 
to rat OBPII (Dear et al . , 1991), we constructed a 
phylogenetic distance tree with the vertebrate lipo- 
calins (figure 8) . 

The inventors demonstrated nine main groups of lipo- 
calins derived from a common precursor: 

the apolipoprotein and retinol binding protein (RBP) 
family (group 1) ; 

the prostaglandin D synthase and neutrophil 
gelatinase-associated lipocalin precursor group 
( group 2 ) ; 

the alpha-l-microglobulin/bikunin (protein HC) sub- 
family (group 3); 

the orosomucoid subfamily (A1AG, AlAH, AlAI ) (group 
4) ; 

the oral sphere subfamily 1 (OBPII-type-LCNl/VEGP, 
VNSP I and II, LALP, CanFl) (group 5); 
the lactoglobulin subfamily (group 6); 
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the subfamily of proteins secreted by the lizard 
epididymis (group 7 ) ; 

the oral sphere subfamily 2 {mouse major urinary 
proteins (MUP) ) (group 8) ; 
5 - the oral sphere subfamily 3 (0BP1, mouse OBPII, 
aphrodisin, probasin, BD20) (group 9). 

The present invention relates more particularly to 
groups 5 , 8 and 9 . 

10 

Group 5 contains the hOBPII proteins which are closely 
related from an evolutionary point of view to rat 
OBPII. Taken together, the present results indicate 
that the human hOBPII genes are orthologs of the rat 
15 OBPII gene. This group also contains the LCNl-VEGP 
proteins of various species. Taking into consideration 
the genomic organization of the hOBPII-LCNl genes, the 
data from the tree illustrate the duplication event 
(arrow) which gives rise to the ancestral hOBPII and 

2 0 LCNl genes from their common lipocalin precursor 

(figure 9) . More recently, the original duplications of 
the 5 0 kb region containing hOBPII-LCNl (arrow) 
generated, in humans, the two hOBPII genes and the LCNl 
gene and its pseudogene LCNlb. The additional duplica- 
25 tion which gave rise to the LCNlc pseudogene is partial 
in the human genome and does not produce a functional 
protein, and is therefore absent from the present tree. 
In addition, the two rat VEG proteins are more closely 
related to one another in the phylogenetic tree than to 

3 0 the human LCNl protein. The situation is identical for 

the two human OBPII proteins compared to the rat OBPII 
protein. These results are in favor of a process of 
gene conversion for at least some lipocalin genes. This 
may also be correlated to the fact that these genes are 
3 5 located on the same chromosomal arm. 



Group 9 corresponds to the oral sphere family 3 and 
contains OBPs which have already been described. The 
aphrodisin protein has already been described as a 
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pheromone transporter (Henzel et al . , 1988) and appears 
to be orthologous to the rat and mouse OBPl protein, 
with two paralogous OBPl genes. Finally, the olfactory 
lipocalins produced by the Bowman's glands of rana 
5 pipiens (OLFA RANPI) , which are considered to be 
potential odorant transporters in frog mucus, are not 
related to any putative OBP group (groups 5, 8 and 9), 
thus suggesting the existence of other OBP categories. 

10 Some. lipocalins have been described as being allergens. 
The major allergen of dogs, the CanFl protein, mainly 
expressed in the von Ebner ' s glands (Konieczny et al . , 
1997), appears in the tree to be the dog VEG protein. 
This result, which shows that LCN1 may be allergenic, 

15 prompts the inventors to propose that the OBPII 
proteins are involved in allergic processes. Similarly, 
the major allergen of horses, EquCl, expressed in the 
liver and the salivary glands (Gregoire et al . , 1996), 
appears to be orthologous to an MUP protein member 

2 0 (group 3) . The major allergen of cows (BD2 0 BOSTA) , 
which is present in the oral sphere family 2 (group 4) 
is more closely related to probasin. CanF2 , which is 
mainly expressed in the parotid gland (Konieczny et 
al . , 1997), does not appear to be orthologous to a 

2 5 previously described lipocalin. 

The present invention has made it possible to reveal 
the existence of a genomic duplication at the q34 locus 
of human chromosome 9, which harbors a gene family of 

30 the LCN1 type; this family comprises, besides the LCN1 
gene, which has been previously described, two pseudo- 
genes and also two hOBPII genes which are paralogous to 
LCN1 . The inventors have revealed that the hOBPII-LCNl 
family results from consecutive genomic duplication 

35 events. The sequences and also the genomic organization 
revealed that the LCN1 and hOBPII genes derive from a 
common ancestor and were generated by means of tandem 
duplication. The sequence comparisons demonstrated that 
the hOBPIIaa and hOBPIIba proteins are the hOBPII 



protein forms closest to LCNl ; this is corroborated by 
the fact that the size of exon 5 of LCNl is 121 bp, 
which corresponds to the size of exon 5a of the hOBPII 
protein, and by the fact that no exon 3b was found in 
the LCNl protein. Similar results are obtained when 
this comparison is carried out with the other seven 
exons of the lipocalins. These data are in favor of the 
diversity of the hOBPII proteins being acquired through 
the integration of surrounding additional genomic DNA 
at the upstream splice acceptor site for exon 5 of 
hOBPIIa or through the recruitment of a supernumerary 
exon (exon 3b) for the hOBPIIb protein. The hypothesis 
of a recruitment of exons rather than a decrease in the 
size of the exons or than a loss of exons is more 
probable . 

The inventors showed that the hOBPII and LCNl genes 
encode proteins involved in various functions, as shown 
by the expression of these genes in both the oral 
sphere and the genital sphere. In addition, the 
inventors showed, by the phylogenetic analysis 
(example 6) , that several different proteins could 
contribute to the same odorant-binding protein 
function; thus, three lipocalin subfamilies (groups 5, 
8 and 9) corresponding to several proteins are found 
expressed in the oral sphere, and in particular in the 
nasal and buccal glands . 

This analysis, showing that several lipocalins 
contribute to the same physiological function and that 
the same protein could contribute to various functions, 
led the inventors to analyze the expression of the 
other human lipocalins in all of the tissues studied 
(figure 10) . This led to the demonstration that the 
gene encoding apolipoprotein D is expressed in the 
glands of the nasal concha and of the middle meatus, 
thus making it a potential odorant-binding protein. 
Similarly, retinol binding protein (RBP) is expressed 
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by the nasal epithelium and may also contribute to this 
function . 

The inventors therefore propose the inclusion of the 
5 ApoD and RBP proteins in the human OBP family and 
include them in all the claims relating to these human 
OBPs . 

The inventors have demonstrated that the transcription 
10 of the two hOBPII genes generates many alternative 
transcripts which encode different proteins, the 
structure of which is compatible with that of a hydro- 
phobic ligand transporter. 

The inventors have demonstrated the expression of the 
two hOBPII genes in the oral sphere (nasal glands, von 
Ebner's glands, submaxillary glands, lacrymal glands, 
lung) . The inventors have also demonstrated that the 
hOBPII proteins according to the invention are produced 
by the cells of the genital sphere; the hOBPII gene is 
mainly expressed in the prostate, the vas deferens and 
the mammary glands, while the expression of the hOBPIIa 
gene is restricted to the vas deferens. The in situ 
analyses have revealed that the mRNA is produced by the 
glandular cells of the prostate and epithelial 
secretory cells of the vas deferens, supporting the 
idea that the corresponding proteins are secreted into 
the seminal fluid. The proteins according to the 
invention may therefore be involved in the reproductive 
function, but also in all the other functions usually 
attributed to lipocalins . 

EXAMPLE 7: PRODUCTION OF A RECOMBINANT HOBPIIBOC PROTEIN 
IN A PROKARYOTIC SYSTEM 

35 

A PCR on the plasmid DNA of an hOBPIIba clone using the 
primers Blla/b (5' GTC GGA TCC CTG TCC TTC ACC CTG GAG 
G 3'), a sense oligonucleotide beginning 45 bases after 
the protein-initiating ATG, and Xllb (5' GTC CTC GAG 



20 



25 
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GTG TTC GGG AAC GCA GCT TC 3 ' ) , an antisense oligo- 
nucleotide preceding the stop codon of the hOBPIIbCt 
protein, made it possible to amplify all of the DNA 
encoding the secreted hOBPIIba protein. The BamH I and 
5 Xho I enzymatic restriction sites located at the ends 
of the two oligonucleotides (bases underlined) were 
used for directional cloning into a plasmid expression 
vector pGEX-6Pl, followed by transformation by electro- 
poration (1800 V, 200 Q. , 25 |IF) into a bacterial strain 

10 BL21. Synthesis of the recombinant protein is obtained 
by adding IPTG, at 5 rriM final for 3 h, to 250 ml of 
culture of the strain in LB medium containing 100 fig/ml 
of ampicillin pre-incubated at 37 °C for 2 h. The 
centrifuged cultures are taken up in 25 ml of TENGN 

15 buffer. The lysate is sonicated and recentrif uged . The 
fusion protein is then purified using 4 ml of beads 
bearing covalently attached insoluble glutathione 
(Sigma) per 25 ml of supernatant. After incubation for 
4 h, they are washed with 3 volumes of 1M NaCl and then 

20 with 10 volumes of 1 X PBS. The elution is obtained by 
bringing the beads into contact with a glutathione 
solution (10 mM reduced glutathione, 50 inM Tris-HCl, pH 
8.0) for 10 min. 

2 5 The amount of recombinant protein produced is estimated 
by migration on polyacrylamide gel and staining with 
Coomassie blue (figure 11). The specificity of induction 
of a recombinant protein is tested by Western blot 
using a goat anti-GST antibody (figure 11) , revealed 

30 with a peroxidase-coupled goat anti-IgG antibody, using 
an ECL+plus kit (Amersham) . The separation of the 
fusion protein into two proteins is obtained by 
proteolysis of 100 ^g of pre-dialyzed recombinant 
protein with 2U of preScission™ protease (Pharmacia 

35 Biotech) in 10 pil final containing 1 X enzyme buffer, 
for 4 h at 5°C. 
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EXAMPLE 8: DETECTION OF THE hOBPIIs IN VARIOUS TISSUES 
BY IMMUNOHISTOCHEMISTRY 



The location of the hOBPn proteins in the nasal 
5 structures (figures 12 and 15), the buccal structures 
and the lacrymal glands (figures 13 and 18), the 
mammary glands and the lungs (figures 14 and 18) is 
revealed by immunohistochemistry . 

10 EXAMPLE 9: DETECTION OF THE hOBPIIs, LCNl and ApoD IN 
THE NASAL STRUCTURES BY RT-PCR AND BY 
IMMUNOHISTOCHEMISTRY 



The RT-PCRs carried out using the sets of primers for 
15 the hOBPIIs and for LCNl reveal the presence of the 
corresponding mRNAs in at least one of the nasal 
structures. The techniques of immunohistochemical 
detection both by fluorescence detection and by 
revelation using peroxidase and DAB reveal the presence 

2 0 of the hOBPII (figure 15) and LCNl (figure 16) proteins 

in the epithelial cells of the acini of the nasal 
glands, and also in the lumina of the secretory ducts. 
Some can also be detected on mucus attached to the 
tissues . 

25 

The RT-PCRs carried out using the sets of primers for 
apolipoprotein D reveal the presence of the corres- 
ponding mRNAs in at least one of the nasal structures. 
The techniques of immunohistochemical detection by 

3 0 revelation using peroxidase and DAB reveal the presence 

of the ApoD protein (figure 17) in the epithelial cells 
of the acini of the nasal glands, and also in the 
lumina of the secretory ducts, although the expression 
appears to be weaker and in fewer acini than for the 
35 hOBPIIs. 
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EXAMPLE 10: DETECTION OP THE hOBPIIs, LCNl and ApoD IN 
THE PLACENTA BY RT-PCR AND BY IMMUNOHISTO- 
CHEMI S TRY 



5 The RT-PCRs carried out using the sets of primers for 
the hOBPIIs and for LCN1 reveal the presence of the 
corresponding mRNAs in the placenta . The techniques of 
immunohistochemical detection by revelation using 
peroxidase and DAB reveal the presence of the hOBPII 

10 (figure 19) and LCNl (figure 20) proteins very strongly 
in the lacunae of the placental villi and much more 
weakly, and very diffusely, in the epithelial cells of 
the amniotic membrane and in subjacent cells, both for 
the amniotic membrane and for the cord. The expression 

15 of the mRNAs detected by RT-PCR in the placental villi 
indicates that these hOBPII proteins are produced by 
the placenta in order to transfer lipophilic molecules 
(hormones, fatty acids, vitamins, etc.) to the fetus 
via the fetal blood circulation. The proteins detected 

2 0 in the amniotic membrane may correspond to the 
absorption of these proteins from the amniotic fluid, 
since it is known that the nasal structures of the 
fetus are bathed in this amniotic fluid. The hOBPII and 
LCNl proteins would thus contribute to the transport of 

25 lipophilic substances which are small in size from the 
mother to the fetus in the chorionic villi via the 
fetal blood, and from the fetus to the mother via the 
amniotic fluid. 



3 0 The RT-PCRs carried out using the sets of primers for 
the ApoD protein reveal the presence of the 
corresponding mRNAs in the placenta . The techniques of 
immunohistochemical detection by revelation using 
peroxidase and DAB reveal the presence of the ApoD 

35 protein very strongly in the lacunae of the placental 
villi (figure 21). The ApoD protein is not detected in 
the cells of the amniotic membrane nor in subjacent 
cells, both for the amniotic membrane and for the 
cordon. The expression of the mRNAs detected by RT-PCR 
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in the placenta indicates that the ApoD protein is 
produced by the placenta in order to transfer lipophilic 
molecules (hormones, fatty acids, vitamins, etc.) to 
the fetus via the fetal blood circulation. The ApoD 
5 protein may also be produced by the fetal liver. The 
ApoD protein would thus contribute to the transport of 
lipophilic substances which are small in size from the 
mother to the fetus in the chorionic villi via the 
fetal blood. 

10 

More generally, the expression of lipocalins by the 
placenta, such as alpha-l-acid glycoprotein (A1AG) , 
prostaglandin D synthase (PGDS) , PP14 and RBP, as 
revealed by the RT-PCR techniques, indicates that these 

15 proteins, which have hydrophobic molecule- transporting 
properties, are recruited in massive amounts for 
mother-child connections, for transporting these 
molecules which may be steroid hormones, fatty acids, 
vitamins or xenobiotics, and more generally metabolism- 

20 derived detoxification products. 
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CLAIMS 

1. An isolated polypeptide comprising an amino acid 
sequence having at least 90% identity with the amino 
acid sequences SEQ ID No. 2, SEQ ID No. 4, SEQ ID 
No. 6, SEQ ID No. 8, SEQ ID No. 12 or SEQ ID No. 14. 

2. An isolated polypeptide, characterized in that it 
comprises a polypeptide chosen from: 

a) a polypeptide of sequence SEQ ID No . 2, SEQ ID No . 4, 
SEQ ID No. 6, SEQ ID No . 8, SEQ ID No. 12 or SEQ ID 
No. 14; 

b) a polypeptide which is a variant of a polypeptide of 
amino acid sequences defined in a) ; 

c) a polypeptide homologous to the polypeptide defined 
in a) or b) and comprising at least 90% identity 
with said polypeptide of a) ; 

d) a fragment of at least 15 consecutive amino acids of 
a polypeptide defined in a) , b) or c) ; 

e) a biologically active fragment of a polypeptide 
defined in a) , b) or c ) . 

3. An isolated polypeptide selected from a poly- 
peptide corresponding to the sequence SEQ ID No . 2 and 
named OBPII aa , to the sequence SEQ ID No. 4 and named 
OBPII a p, to the sequence SEQ ID No . 6 and named OBPII ay , 
and to the sequence SEQ ID No. 12 and named 0BPII b p. 

4. The polypeptide as claimed in any one of claims 1 
to 3, characterized in that it comprises at least the 
Gly-Thr-Trp-Tyr domain. 
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5. An isolated polynucleotide, characterized in that 
it encodes a polypeptide as claimed in any one of 
claims 1 to 4 . 



5 6. The polynucleotide as claimed in claim 5, of 
sequence SEQ ID No . 1, SEQ ID No . 3, SEQ ID No . 5, SEQ 
ID No. 7, SEQ ID No. 11 or SEQ ID No. 13. 

7. An isolated polynucleotide, characterized in that 
10 it comprises a polynucleotide chosen from: 



a) a polynucleotide of sequence SEQ ID No. 1, SEQ ID 

No. 3, SEQ ID No. 5, SEQ ID No . 7, SEQ ID No. 11 or 

SEQ ID No. 13, or the sequence of which is that of 

15 the RNA corresponding to the sequence SEQ ID No. 1, 

SEQ ID No. 3, SEQ ID No. 5, SEQ ID No. 7, SEQ ID 
No. 11 or SEQ ID No. 13; 



b) a polynucleotide, the sequence of which is comple- 
2 0 mentary to the sequence of a polynucleotide defined 

in a) , 



c) a polynucleotide, the sequence of which comprises at 
least 80% identity with a polynucleotide defined in 
25 a) or b) , 



d) a fragment of at least 15 consecutive nucleotides, 
in preference 21 consecutive nucleotides, and 
preferably 3 0 consecutive nucleotides, of a poly- 
30 nucleotide defined in a) , b) , c) or d) . 



8. The polynucleotide as claimed in claim 7, charac- 
terized in that it is labeled directly or indirectly 
with a radioactive compound or a nonradioactive 

3 5 compound . 

9. The use of a polynucleotide as claimed in claim 8, 
as a probe for detecting nucleic acid sequences. 



10. The use of a polynucleotide as claimed in any one 
of claims 5 to 8, as a primer for amplifying or 
polymerizing nucleic acid sequences. 

11. The use of a polynucleotide as claimed in claim 8, 
as a sense or anti sense oligonucleotide for controlling 
the expression of the corresponding protein product. 

12 . A recombinant vector for cloning a polynucleotide 
as claimed in one of claims 5 to 8 and/or for 
expressing a polypeptide as claimed in one of claims 1 
to 4, characterized in that it contains a poly- 
nucleotide as claimed in any one of claims 5 to 8. 

13. The vector as claimed in claim 12, characterized 
in that it comprises the elements which allow the 
expression, optionally the secretion, of said poly- 
peptide in a host cell. 

14 . The vector as claimed in any one of claims 12 to 
13, characterized in that the elements which allow the 
expression of said polypeptide are chosen from: 

a) the isolated polynucleotide of sequence SEQ ID 
No. 15 and SEQ ID No. 16; 

b) a polynucleotide, the sequence of which is comple- 
mentary to the sequence of the polynucleotide 
defined in a) ; 

c) a polynucleotide, the sequence of which comprises at 
least 80% identity with a polynucleotide defined in 
a) or in b) ; 

d) a polynucleotide which hybridizes, under conditions 
of high stringency, with a sequence of the poly- 
nucleotide defined in a) , b) or c) . 
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15. The vector as claimed in claims 13 and 14, for 
expression in eukaryotic cells, selected from viral DNA 
and naked DNA. 

16. A host cell, characterized in that it is trans- 
formed with a vector as claimed in one of claims 12 to 
15. 

17. A process for preparing a recombinant polypeptide, 
characterized in that a host cell as claimed in 
claim 16 is cultured under conditions which allow the 
expression and, optionally, the secretion of said 
recombinant polypeptide, and in that said recombinant 
polypeptide is recovered. 

18 . A recombinant polypeptide which can be obtained 
using a process as claimed in claim 17. 

19 . The use of a polypeptide chosen from a polypeptide 
as claimed in any one of claims 1 to 4 and 18, or a 
fragment thereof, as a protein which binds to a hydro- 
phobic ligand, preferably an odorous molecule. 

20. The use of a polypeptide as claimed in any one of 
claims 1 to 4 and 18, or of the polypeptide hOBPII ba of 
sequence SEQ ID No. 10, a polypeptide which is a 
variant of said polypeptide hOBPIIba, a polypeptide 
homologous to said polypeptide hOBPII ba comprising at 
least 90% identity with said polypeptide hOBPII ba or a 
fragment thereof, as a competitive inhibitor, as an 
agonist or as an antagonist of the cellular receptors 
for lipocalins. 

21. A monoclonal or polyclonal antibody, and fragments 
thereof, characterized in that it is specifically 
against an isolated polypeptide as claimed in one of 
claims 1 to 4 and 18 . 
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22. The use of an antibody as claimed in claim 21, for 
demonstrating the presence of a polypeptide as claimed 
in one of claims 1 to 4 and 18 in a biological sample. 

23 . A process for detecting an antibody against hOBPII 
in human serum from an allergic and/or asthmatic 
patient, using an hOBPII polypeptide. 

24. A process for controlling the volatilization of an 
odorant, characterized in that it comprises a step of 
binding of said odorant with a polypeptide as claimed 
in any one of claims 1 to 4 and 18, or with the poly- 
peptide hOBPII ba of sequence SEQ ID No. 10, a poly- 
peptide which is a variant of said polypeptide hOBPII ba 
or a polypeptide homologous to said polypeptide hOBPII bot 
comprising at least 90% identity with said polypeptide 
hOBPII ba . 

25. The process as claimed in claim 24, characterized 
in that the polypeptide is bound to a solid support. 

26. The process as claimed in claim 24, characterized 
in that the polypeptide is in a liquid composition. 

27. The process as claimed in claim 26, characterized 
in that said composition is a fragranced composition 
for the skin. 

28. A process for screening a molecule, preferably 
odorants or flavors, which comprises passing the 
molecule over a substrate which comprises a polypeptide 
as claimed in any one of claims 1 to 4 and 18, or the 
polypeptide hOBPIIba of sequence SEQ ID No . 10, a poly- 
peptide which is a variant of said polypeptide hOBPII ba 
or a polypeptide homologous to said polypeptide hOBPII ba 
comprising at least 90% identity with said polypeptide 
hOBPIIba, bound to said substrate, said polypeptide 
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binding said odorant or flavor, and recovering said 
odorant or flavor from the polypeptide if necessary. 

29. The process as claimed in claim 28, characterized 
in that the molecules are human pheromones . 

30. A process for solubilizing lipophilic molecules, 
characterized in that it comprises binding said 
molecule to a polypeptide as claimed in any one of 
claims 1 to 4 and 18, or to the polypeptide hOBPII ba of 
sequence SEQ ID No. 10, a polypeptide which is a 
variant of said polypeptide hOBPII bct or a polypeptide 
homologous to said polypeptide hOBPIIba comprising at 
least 90% identity with said polypeptide hOBPII ba . 

31. The application of the polypeptides as claimed in 
one of claims 1 to 4 and 18, or of the polypeptide 
hOBPII ba of sequence SEQ ID No. 10, of a polypeptide 
which is a variant of said polypeptide hOBPII ba or of a 
polypeptide homologous to said polypeptide hOBPII b a 
comprising at least 90% identity with said polypeptide 
hOBPIIba, in combination with dietary fatty acids, as a 
food supplement . 

32. The application as claimed in claim 31, for 
preparing a medicinal product intended for the 
treatment of hyper lipidemi as and obesity. 

33. The application as claimed in claim 31, for 
supplementing nonmaternal milks. 

34. The polypeptide as claimed in any one of claims 1 
to 4 and 18, or the polypeptide hOBPII ba of sequence SEQ 
ID No. 10, a polypeptide which is variant of said poly- 
peptide hOBPIIba or a polypeptide homologous to said 
polypeptide hOBPII ba comprising at least 90% identity 
with said polypeptide hOBPII ba , as an agent for 
targeting a pharmaceutical compound. 



- 73 - 



35. The polypeptide as claimed in claim 34, charac- 
terized in that said polypeptide is expressed in the 
form of a protein from fusion with a protein which 

5 allows specific cellular addressing. 

36. The polypeptide as claimed in claim 35, charac- 
terized in that said protein which allows specific 
cellular addressing is chosen from the group composed 

10 of interleukins , of cytokines, of lymphokines, of 
interferons, of growth factors, of hormones and of 
antibodies. 

37. The polypeptide as claimed in claim 34, charac- 
15 terized in that said polypeptide is combined with a 

molecule which allows specific cellular addressing. 

38. The polypeptide as claimed in claim 37, charac- 
terized in that said molecule which allows specific 

2 0 cellular addressing is chosen from the group composed 
of steroids, of interleukins, of cytokines, of lympho- 
kines, of interferons, of growth factors, of hormones 
and of antibodies. 

25 39. The polypeptide as claimed in any one of claims 1 
to 4 and 18, or the polypeptide hOBPII ba of sequence SEQ 
ID No. 10, a polypeptide which is a variant of said 
polypeptide hOBPII ba or a polypeptide homologous to said 
polypeptide hOBPII b a comprising at least 90% identity 

30 with said polypeptide hOBPIIba, as a pharmaceutical 
compound transporter . 

40. A pharmaceutical composition comprising a pharma- 
ceutical compound bound at least to a polypeptide as 
35 claimed in one of claims 34 to 39, and a pharma- 
ceutically acceptable vehicle. 
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41. The pharmaceutical composition as claimed in 
claim 40, characterized in that the pharmaceutical 
compound is chosen from the group of anticancer agents. 

5 42. The pharmaceutical composition as claimed in 
claim 41, characterized in that said anticancer agent 
is a radioactive isotope chosen from the group: 
iodine 131 , yttrium 90 , gold 199 , palladium 100 , copper 67 , 
bismuth 217 and antimony 211 . 

10 

43. The pharmaceutical composition as claimed in any 
one of claims 40 to 42, characterized in that said 
polypeptide as claimed in any one of claims 34 to 39 
constitutes a delayed form of delivery of said 

15 pharmaceutical compound in the body. 

44. A pharmaceutical composition comprising an 
expression vector as claimed in claim 12 or 13, and a 
pharmaceutically acceptable vehicle. 

20 

45. The pharmaceutical composition as claimed in any 
one of claims 40 to 44, for treating cancer preferably 
chosen from breast cancer, uterine cancer, prostate 
cancer, liver cancer and pulmonary epithelial cell 

25 carcinoma. 

46. The use of a polypeptide chosen from the poly- 
peptide hOBPIIba of sequence SEQ ID No. 10, a poly- 
peptide which is a variant of said polypeptide hOBPII ba 

30 and a polypeptide homologous to said polypeptide 
hOBPII ba comprising at least 90% identity with said 
polypeptide hOBPII b a, for preparing a medicinal product 
intended for the treatment of uterine cancer, prostate 
cancer, liver cancer and pulmonary epithelial cell 

3 5 cancer. 



47. The use of the polypeptide as claimed in any one 
of claims 1 to 4 and 18, or the polypeptide hOBPII ba of 



sequence SEQ ID No. 10, or a polypeptide which is a 
variant of said polypeptide hOBPII ba or a polypeptide 
homologous to said polypeptide hOBPIIba comprising at 
least 90% identity with said polypeptide hOBPII ba , for 
preparing a medicinal product intended for the 
transport of a compound across the placental barrier. 

48. The use as claimed in claim 47, characterized in 
that said transport is carried out from the gestating 
mother to the fetus, and in that said compound is 
chosen from hormones, essential fatty acids, lipophilic 
medicinal products and vitamins. 

49. The use as claimed in claim 47, characterized in 
that said transport is carried out from the fetus to 
the mother and is intended for the detoxification of 
the fetus. 

50. The polypeptide as claimed in claims 1 to 4 and 
18, or the polypeptide hOBPII bot of sequence SEQ ID 
No. 10, a polypeptide which is a variant of said poly- 
peptide hOBPIIba or a polypeptide homologous to said 
polypeptide hOBPII ba comprising at least 90% identity 
with said polypeptide hOBPII ba , as a pregnancy marker. 

51. The polypeptide as claimed in claims 1 to 4 and 
18, or the polypeptide hOBPII ba of sequence SEQ ID 
No. 10, a polypeptide which is a variant of said poly- 
peptide hOBPIIba or a polypeptide homologous to said 
polypeptide hOBPII b a comprising at least 90% identity 
with said polypeptide hOBPII ba , as a marker for a feto- 
placental pathological condition. 
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Atty. Dkt. No. 065691-0260 



DECLARATION AND POWER OF ATTORNEY 

As a below named inventor, I HEREBY DECLARE: 

THAT my residence, post office address, and citizenship are as stated below next to my 

name; 

THAT I believe I am the original, first, and sole inventor (if only one inventor is named 
below) or an original, first, and joint inventor (if plural inventors are named below or in an 
attached Declaration) of the subject matter which is claimed and for which a patent is sought 
on the invention entitled ODORANT— BINDING HUMAN PROTEINS HYDROPHOBIC , LIGANDS : POLYPEPTIDES 
AND POLYNUCLEOTIDES CODING FOR SAID POLYPEPTIDES AND OSES THEREOF 



(Attorney Docket No. 065691-0260) 

the specification of which (check one) 

is attached hereto. 

XX was filed on Au g. 11. 2000 as:^i«&KgXaaSgXSi^^ 

PCT International Application Number prrr/yRno/nyU Q and was 
amended on (if applicable). 

THAT I do not know and do not believe that the same invention was ever known or 
used by others in the United States of America, or was patented or described in any printed 
publication in any country, before I (we) invented it; 

THAT I do not know and do not believe that the same invention was patented or 
described in any printed publication in any country, or in public use or on sale in the United 
States of America, for more than one year prior to the filing date of this United States 
application; 

THAT I do not know and do not believe that the same invention was first patented or 
made the subject of an inventor's certificate that issued in any country foreign to the United 
States of America before the filing date of this United States application if the foreign 
application was filed by me (us), or by my (our) legal representatives or assigns, more than 
twelve months (six months for design patents) prior to the filing date of this United States 
application; 

THAT I have reviewed and understand the contents of the above-identified specification, 
including the claim(s), as amended by any amendment specifically referred to above; 

THAT I believe that the above-identified specification contains a written description of 
the invention, and of the manner and process of making and using it, in such full, clear, concise, 
and exact terms as to enable any person skilled in the art to which it pertains, or with which it 
is most nearly connected, to make and use the invention, and sets forth the best mode 
contemplated by me of carrying out the invention; and 

THAT I acknowledge the duty to disclose to the U.S. Patent and Trademark Office all 
information known to me to be material to patentability as defined in Title 37, Code of Federal 
Regulations, §1.56. 
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I HEREBY CLAIM foreign priority benefits under Title 35, United States Code §119(a)-(d) 
or § 365(b) of any foreign application(s) for patent or inventor's certificate, or §365(a) of any 
PCT international application which designated at least one country other than the United States 
of. America, listed below and have also identified below any foreign application for patent or 
inventor's certificate or of any PCT international application having a filing date before that of 
the application on which priority is claimed. 



Prior Foreign 
Application Number 


Country 


Foreign Filing Date 


Priority 
Claimed? 


Certified 
Copy 
Attached? 


99 10439 


France 


Aug. 12, 1999 


yes 

























I HEREBY CLAIM the benefit under Title 35, United States Code § 1 19(e) of any United 
States provisional application(s) listed below. 



U.S. Provisional Application Number 


Filing Date 















I HEREBY CLAIM the benefit under Title 35, United States Code, §1 20 of any United 
States application(s), or § 365(c) of any PCT international application designating the United 
States of America, listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States or PCT International application in the 
manner provided by the first paragraph of Title 35, United States Code, § 1 1 2, I acknowledge 
the duty to disclose information which is material to patentability as defined in Title 37, Code of 
Federal Regulations, § 1 .56 which became available between the filing date of the prior 
application and the national or PCT international filing date of this application. 



U.S. Parent 
Application Number 


PCT Parent 
Application Number 


Filing Date 


Parent 
Patent Number j 




PCT/FR00/02319 


August 11, 2000 






















to have full power to prosecute this application and any continuations, divisions, reissues, and 
reexaminations thereof, to receive the patent, and to transact all business in the United States 
Patent and Trademark Office connected therewith. 

5 
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I request that all correspondence be directed to: 



Stephen B. Maebius 
EEY& LARDNER 
Cust omeT^um ber: 22428 




PATENT TRADEMAXKOFFICE 



Telephone: 
Facsimile: 



(202) 672-5569 
(202) 672-5399 



I UNDERSTAND AND AGREE THAT the foregoing attorneys and agents appointed by me 
to prosecute this application do not personally represent me or my legal interests, but instead 
represent the. interests of the legal owner(s) of the invention described in this application. 

I FURTHER DECLARE THAT all statements made herein of my own knowledge are true, 
and that all statements made on information and belief are believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code, and that such willful false statements may jeopardize the validity of the 
application or any patent issuing thereon. 



Name of first inventor 

Residence 

Citizenship 

Post Office Address 

Inventor's signature 

Date 



PITIOT Gllles 



151 rue du Chevaleret. 75013 Paris / Franci 
Frenrh 



The same a; 



Name of second inventoi 
Residence 
Citizenship 
Post Office Address 
Inventor's signature 
Date 




ACAZETTE Eric 



Appai-t:piimnt: 221- 66 boul evard La Fayette. 63000 C lenncm4i 

Ker rand / France 1^1 
French « ■-■ - — — "7 / 

The same as residence 
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Name of third inventor 

Residence 

Citizenship 

Post Office Address 

Inventor's signature 

Date 

Name of fourth inventor 

Residence 

Citizenship 

Post Office Address 

Inventor's signature 

Date 
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GACHOW F ranco! Re , 



11 nip Hps Pai Harris . 6T>40 R om ag"^ L Fra n, 
French . 



The same as residence 



June 13, 2002 
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SMALL ENTITY DECLARATION 

[37 CFR 1.9(c-f)J 



Each undersigned declares that: 

(1) □ the application attached hereto 

(2) □ U.S. Application Serial No filed 

(3) □ U.S. Patent No Issued 

is entitled to the benefits of "small entity" status for paying reduced fees under 35 USC 41(a) and (b) to 
the Patent and Trademark Office by virtue of the following: 

(4) [2 Each undersigned declares that hefehe qualifies as an independent inventor, or would qualify 
had he/she made the invention, as defined in 37 CFR 15(c). 

(5) □ The undersigned declares that he/she is an official empowered to act on behalf of the concern 
identified below; that this concern qualifies as a small business concern as defined in 37 CFR 1.9(d). 

(6) Q The undersigned declares the he/she is an official empowered to act on behalf of the 
organization identified below; that this organization qualifies as a nonprofit organization as defined in 

(a) □ 37 CFR 1.9(e)(1) 

(b) □ 37 CFR 1.9(e)(2) 

(c) □ 37 CFR 1.9(e)(3) 

(d) □ 37 CFR 1.9(e)(4) State law of __Eranee 

(7) Each person, concern or organization to which l/we have assigned, granted, conveyed or licensed, 
or am under an obligation under contract or law to assign, grant, convey, or license any rights in the invention 
is listed below: 

(a) □ no such person, concern or organization 

(b) □ persons, concerns or organizations listed below 

[a separate declaration is required from each named person, concern or organization having rights 
to this invention averring to their status as "small entities."! 

Full Name PITIOT Gilles 

Address 151 rue du Chevaleret, 75013 Paris / France 

S Individual □ Small Business Concern □ Nonprofit Organization 

l/we acknowledge the duty to file, in this application or patent, notification of any change in status 
resulting in loss of entitlement of small entity status prior to paying, or at the time of paying, the earliest 
of the issue fee or any maintenance fee due after the date on which status as a small entity is no longer 
appropriate. (37 CFR 1.28(b)) 

l/we hereby declare all statements made herein of his/her own knowledge are true and that all statements 
made on information and belief are believed to be true; and further that these statements were made with 
the knowledge that willful false statements so made are pyjaishable by fine or imprisonment, or both, under 
Section 1001 of Title 18 of the United States Codeaj^that^uch ^willful false statements may jeopardize 
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SEQUENCE LISTING 
<110> UNIVERSITE D'AUVERGNE CLERMONT I 

<120> HUMAN ODORANT— BINDING PROTEINS WHICH BIND HYDROPHOBIC LIGANDS : 

POLYPEPTIDES AND POLYNUCLEOTIDES ENCODING SAID POLYPEPTIDES, AND 
USES THEREOF 

<130> D15947 
<160> 16 

<170> Patentln Vers. 2.0 

<210> 1 

<211> 676 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43).. (552) 

<223> cDNA396 (676) /gl (hOBPI la-alpha ) 
<4 00> 1 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag 



ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 
5 10 IS 20 

gag gag gag gat ate aca ggg acc tgg tac gtg aag gec atg gtg gtc 
Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys Ala Met Val Val 
25 30 35 

gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec cca gtg 
Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser Pro Val 
40 45 SO 

aag gtg aca gec ctg ggc ggt ggg aac ttg gaa gee acg ttc acc ttc 
Lys Val Thr Ala Leu Gly Gly Gly Asn Levi Glu Ala Thr Phe Thr Phe 
55 60 65 

atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg aag acq 
Met Arg Glu Asp Arg Cys He Gin Lys Lys He Leu Met Arg Lys Thr 
70 75 go 

gag gag cct ggc aaa ttc age gee tat ggg ggc agg aag etc ata tac 
Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg Lys Leu He Tyr 
85 90 95 100 

ctg cag gag ctg ccc ggg acg gac gac tac gtc ttt tac tgc aaa gac 
Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe Tyr Cys Lys Asp 
105 no i 15 
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cag cgc cgt ggg ggc ctg cgc tac atg gga aag ctt gtg ggt agg aat 438 
Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu Val Gly Arg Asn 
120 125 130 

cct aat acc aac ctg gag gcc ctg gaa gaa ttt aag aaa tug gtg cag 486 
Pro Asn Thr Asn Leu Glu Ala Leu Glu Glu Phe Lys Lys Leu Val Gin 
135 140 145 

cac aag gga etc teg gag gag gac att ttc atg ccc ctg cag acg gga 534 
His Lys Gly Leu Ser Glu Glu Asp He Phe Met Pro Leu Gin Thr Gly 
150 155 160 

age tgc gtt etc gaa cac taggcagccc ccgggtctgc acctccagag 592 
Ser Cys Val Leu Glu His 
165 170 

cccaccctac caccagacac agagecegga ccacctggac ctaccctcca gccatgaccc 642 
ttccctgctc ccacccacct gactccaaat aaag 676 

<210> 2 
<211> 170 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
1 5 10 15 

Ser Phe Thr Leu Glu Glu Glu Asp He Thr Gly Thr Trp Tyr Val Lys 
20 25 30 

Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys 
35 40 4 5 

Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala 
SO 55 60 

Thr Phe Thr Phe Met Arg Glu Asp Arg Cys He Gin Lys Lys Tie Leu 
65 70 75 80 

Met Arg Lys Thr Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg 

85 90 95 

Lys Leu He Tyr Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe 
100 105 no 

Tyr Cys Lys Asp Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu 
115 120 125 

Val Gly Arg Asn Pro Asn Thr Asn Leu Glu Ala Leu Glu Glu Phe Lys 
130 135 140 

Lys Leu Val Gin His Lys Gly Leu Ser Glu Glu Asp He Phe Met Pro 
145 150 155 160 

Leu Gin Thr Gly Ser Cys Val Leu Glu His 



WO 01/12806 



PCT/FR00/02319 



<210> 3 

<211> 725 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43).. (480) 

<223> CDNA396 (725) /SM12 (hOBPIIa-beta ) 
<400> 3 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 54 

Met Lys Thr Leu 
1 

ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 102 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 
5 10 15 20 

gag gag gag gat ate aca ggg acc tgg tac gtg aag gec atg gtg gtc 150 
Glu Glu Glu Asp He Thr Gly Thr Trp Tyr Val Lys Ala Met Val Val 
25 30 35 

gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec cca gtg 198 
Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser Pro Val 
40 45 50 

aag gtg aca gec ctg ggc ggt ggg aac ttg gaa gee acg ttc acc ttc 24 6 
Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala Thr Phe Thr Phe 
55 60 65 

atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg aag acg 294 
Met Arg Glu Asp Arg Cys He Gin Lys Lys He Leu Met Arg Lys Thr 
70 75 80 

gag gag cct ggc aaa ttc age gec tat ggg ggc agg aag etc ata tac 342 
Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg Lys Leu He Tyr 
85 90 95 ioo 

ctg cag gag ctg ccc ggg acg gac gac tac gtc ttt tac tgc aaa gac 390 
Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe Tyr Cys Lys Asp 
105 HO us 

cag cgc cgt ggg ggc ctg cgc tac atg gga aag ctt gtg ggg ccg tgc 438 
Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu Val Gly Pro Cys 
120 125 130 

cgc tgt ccc cac gtc ggc tea cct ggc cac etc acc tgc agg 4Q0 
Arg Cys Pro His Val Gly Set Pro Gly His Leu Thr Cys Arg 
135 140 145 

taggaatcct aataccaacc tggaggcect ggaagaattt aagaaattgg tgcagcacaa 540 

gggactctcg gaggaggaca ttttcatgee cctgcagacg ggaagctgcg ttctcgaaca 600 
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ctaggcagcc cccgggtctg cacctccaga gcccacccta ccaccagaca cagagcccgg 660 
accacctgga cctaccctcc agccatgacc cttccctgct cccacccacc tgactccaaa 720 
taaag 725 

<210> 4 
<211> 146 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 lO 15 

Ser Phe Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys 
20 25 30 

Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys 
35 4 0. 4 5 

Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala 

50 55 60 

Thr Phe Thr Phe Met Arg Glu Asp Arg Cys lie Gin Lys Lys He Leu 
65 70 75 80 

Met Arg Lys Thr Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg 
85 90 95 

Lys Leu He Tyr Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe 
100 105 110 

Tyr Cys Lys Asp Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu 
115 120 125 

Val Gly Pro Cys Arg Cys Pro His Val Gly Ser Pro Gly His Leu Thr 



<210> 5 
<211> 741 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43) . . (726) 

<223> CDNA396 (741) / SM4 (hOBPI la-gamma) 
<400> 5 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 

Met Lys Thr Leu 
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ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 



gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec cca gtg 
Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser Pro Val 
40 45 50 

aag gtg aca gec ctg ggc ggt ggg aac ttg gaa gee acg ttc acc ttc 
Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala Thr Phe Thr Phe 
55 60 65 

atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg aag acg 
Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu Met Arg Lys Thr 
70 75 80 

gag gag cct ggc aaa ttc age gee tat ggg ggc agg aag etc ata tac 
Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg Lys Leu lie Tyr 
85 90 95 100 

ctg cag gag ctg ccc ggg acg gac gac tac gtc ttt tac tgc aaa gac 
Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe Tyr Cys Lys Asp 
105 110 115 

cag cgc cgt ggg ggc ctg cgc tac atg gga aag ctt gtg gca tct get 
Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu Val Ala Ser Ala 
120 ' 125 * ' 130 

ccc tgc agg gec gtg ccg ctg tec cca cgt egg etc acc tgg cca cct 
Pro Cys Arg Ala Val Pro Leu Ser Pro Arg Arg Leu Thr Trp Pro Pro 
135 140 145 

cac ctg cag gta gga ate eta ata cca acc tgg agg ccc tgg aag aat 
His Leu Gin Val Gly lie Leu lie Pro Thr Trp Arg Pro Trp Lys Asn 
150 155 160 

tta aga aat tgg tgc age aca agg gac tct egg agg agg aca ttt tea 
Leu Arg Asn Trp Cys Ser Thr Arg Asp Ser Arg Arg Axg Thr Phe Ser 
165 170 175 180 

tgc ccc tgc aga egg gaa get gcg ttc teg aac act agg cag ccc ccg 
Cys Pro Cys Arg Arg Glu Ala Ala Phe Ser Asn Thr Arg Gin Pro Pro 
185 190 195 

ggt ctg cac etc cag age cca ccc tac cac cag aca cag age ccg gac 
Gly Leu His Leu Gin Ser Pro Pro Tyr His Gin Thr Gin Ser Pro Asp 
200 205 210 

cac ctg gac eta ccc tec age cat gac cct tec ctg etc cca ccc acc 
His Leu Asp Leu Pro Ser Ser His Asp Pro Ser Leu Leu Pro Pro Thr 
215 220 225 



tgactccaaa taaag 
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<210> 6 

<211> 228 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 10 is 

Ser Phe Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys 

20 25 30 

Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys 
35 4 0 45 

Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly Asn Leu Glu Ala 
50 55 60 

Thr Phe Thr Phe Met Arg Glu Asp Arg Cys lie Gin Lys Lys lie Leu 
65 70 75 80 

Met Arg Lys Thr Glu Glu Pro Gly Lys Phe Ser Ala Tyr Gly Gly Arg 
85 90 95 

Lys Leu lie Tyr Leu Gin Glu Leu Pro Gly Thr Asp Asp Tyr Val Phe 
100 105 no 

Tyr Cys Lys Asp Gin Arg Arg Gly Gly Leu Arg Tyr Met Gly Lys Leu 
115 120 125 

Val Ala Ser Ala Pro Cys Arg Ala Val Pro Leu Ser Pro Arg Arg Leu 
130 135 140 

Thr Trp Pro Pro His Leu Gin Val Gly lie Leu He Pro Thr Trp Arg 
145 150 155 160 

Pro Trp Lys Asn Leu Arg Asn Trp Cys Ser Thr Arg Asp Ser Arg Arg 
165 170 175 

Arg Thr Phe Ser Cys Pro Cys Arg Arg Glu Ala Ala Phe Ser Asn Thr 
180 185 190 

Arg Gin Pro Pro Gly Leu His Leu Gin Ser Pro Pro Tyr His Gin Thr 
195 200 205 

Gin Ser Pro Asp His Leu Asp Leu Pro Ser Ser His Asp Pro Ser Leu 
210 215 220 



<210> 7 

<211> 607 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> CDS 

<222> (43) . . (483) 

<223> cDNA396 (607) - short form (hOBPIIa-delta ) 



<400> 7 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 54 

Met Lys Thr Leu 



ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 102 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 
5 10 15 20 



gag gag gag gat gag gga gaa teg gtg cat cca gaa gaa aat cct gat 150 
Glu Glu Glu Asp Glu Gly Glu Ser Val His Pro Glu Glu Asn Pro Asp 
25 30 35 



gcg gaa gac gga gga gec tgg caa att cag cgc eta tgg ggg cag gaa 198 
Ala Glu Asp Gly Gly Ala Trp Gin lie Gin Arg Leu Trp Gly Gin Glu 
40 45 50 



get cat ata cct gca gga get gec egg gac gga cga eta cgt ctt tta 24 6 

Ala His lie Pro Ala Gly Ala Ala Arg Asp Gly Arg Leu Arg Leu Leu 
55 60 65 

ctg caa aga cca gcg ccg tgg ggg cct gcg eta cat ggg aaa get tgt 294 

Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Leu His Gly Lys Ala Cys 
70 75 80 



ggc ate tgc tec ctg cag ggc cgt gec get gtc ccc acc ttg get cac 342 
Gly lie Cys Ser Leu Gin Gly Arg Ala Ala Val Pro Thr Leu Ala His 
85 90 95 1O0 



ctg gec acc tea cct gca ggt agg aat cct aat acc aac ctg gag gee 390 
Leu Ala Thr Ser Pro Ala Gly Arg Asn Pro Asn Thr Asn Leu Glu Ala 
105 110 115 



ctg gaa gaa ttt aag aaa ttg gtg cag cgc aag gga etc teg gag gag 438 
Leu Glu Glu Phe Lys Lys Leu Val Gin Arg Lys Gly Leu Ser Glu Glu 
120 125 130 



gac att ttc atg ccc ctg cag acg gga age tgc gtt etc gaa cac 483 
Asp lie Phe Met Pro Leu Gin Thr Gly Ser Cys Val Leu Glu His 
135 140 145 



taggcagccc ccgggtctgc acctccagag cccaccctac caccagacac agagecegga 543 
ccacctggac ctaccctcca gccatgaccc ttccctgctc ccacccacct gactccaaat 603 
aaag 607 



<210> 8 

<211> 147 

<212> PRT 

<213> Homo sapiens 



?' !r j; .... t vifi*<^. 



WO 01/12806 PCT/FROO/02319 



<400> 8 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 10 15 

Ser Phe Thr Leu Glu Glu Glu Asp Glu Gly Glu Ser Val His Pro Glu 
20 25 30 

Glu Asn Pro Asp Ala Glu Asp Gly Gly Ala Trp Gin lie Gin Arg Leu 
35 40 45 

Trp Gly Gin Glu Ala His lie Pro Ala Gly Ala Ala Arg Asp Gly Arg 
50 55 60 

Leu Arg Leu Leu Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Leu His 
65 70 75 80 

Gly Lys Ala Cys Gly lie Cys Ser Leu Gin Gly Arg Ala Ala Val Pro 



Leu Ser Glu Glu Asp He Phe Met Pro Leu Gin Thr Gly Ser Cys Val 
130 135 140 



<210> 9 
<211> 676 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43).. (552) 

<223> CDNA2098 (676) - conventional form (hOBPIIb-alpha ) 
<400> 9 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 54 

Met Lys Thr Leu 



ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 102 

Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 

5 10 15 20 

gag gag gag gat ate aca ggg acc tgg tac gtg aag gec atg gtg gtc 150 

Glu Glu Glu Asp He Thr Gly Thr Trp Tyr Val Lys Ala Met Val Val 

25 30 35 

gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec cca gtg 198 

Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser Pro Val 
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aag gtg aca gcc ctg ggc ggt ggg aag ttg gaa gcc acg ttc acc ttc 246 
Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala Thr Phe Thr Phe 
55 60 65 

atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg aag acg 294 
Met Arg Glu Asp Arg Cys He Gin Lys Lys He Leu Met Arg Lys Thr 
70 75 60 

gag gag cct ggc aaa tac age gcc tat ggg ggc agg aag etc atg tac 342 
Glu Glu Pro Gly Lys Tyr Ser Ala Tyr Gly Gly Arg Lys Leu Met Tyr 
85 90 95 100 

ctg cag gag ctg ccc agg agg gac cac tac ate ttt tac tgc aaa gac 390 
Leu Gin Glu Leu Pro Arg Arg Asp His Tyr He Phe Tyr Cys Lys Asp 
105 110 115 

cag cac cat ggg ggc ctg etc cac atg gga aag ctt gtg ggt agg aat 438 
Gin His His Gly Gly Leu Leu His Met Gly Lys Leu Val Gly Arg Asn 
120 125 130 

tct gat acc aac egg gag gcc ctg gaa gaa ttt aag aaa ttg gtg cag 486 
Ser Asp Thr Asn Arg Glu Ala Leu Glu Glu Phe Lys Lys Leu Val Gin 
135 140 145 

cgc aag gga etc teg gag gag gac att ttc acg ccc ctg cag acg gga 534 
Arg Lys Gly Leu Ser Glu Glu Asp He Phe Thr Pro Leu Gin Thr Gly 
150 155 160 

age tgc gtt ccc gaa cac taggcagccc ccgggtctgc acctccagag 582 
Ser Cys Val Pro Glu His 
165 170 

cecaccctac caccagacac agagecegga ccacctggac ctaccctcca gccatgaccc 642 
ttccctgctc ccacccacct gactccaaat aaag 676 



<210> 10 
<211> 170 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 10 15 

Ser Phe Thr Leu Glu Glu Glu Asp He Thr Gly Thr Trp Tyr Val Lys 
20 25 30 

Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys 
35 40 45 

Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala 
50 55 60 

Thr Phe Thr Phe Met Arg Glu Asp Arg Cys He Gin Lys Lys He Leu 



WO 01/12806 



PCT/FR00/02319 



Met Arg Lys Thr Glu Glu Pro Gly Lys Tyr Ser Ala Tyr Gly Gly Arg 
85 90 95 

Lys Leu Met Tyr Leu Gin Glu Leu Pro Arg Arg Asp His Tyr He Phe 
100 105 no 

Tyr Cys Lys Asp Gin His His Gly Gly Leu Leu His Met Gly Lys Leu 
115 120 125 

Val Gly Arg Asn Ser Asp Thr Asn Arg Glu Ala Leu Glu Glu Phe Lys 
130 135 140 

Lys Leu Val Gin Arg Lys Gly Leu Ser Glu Glu Asp He Phe Thr Pro 
145 150 155 160 

Leu Gin Thr Gly Ser Cys Val Pro Glu His 
165 170 

<210> 11 

<211> 782 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)..(537) 

<223> CDNA2098 (782) - long form (hOBPIIb-beta ) 
<400> 11 

cgc cca gtg acc tgc cga ggt egg cag cac aga get ctg gag atg aag 
Arg Pro Val Thr Cys Arg Gly Arg Gin His Arg Ala Leu Glu Met Lys 
15 10 15 

acc ctg ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc 
Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe 
20 25 30 

acc ctg gag gag gag gat ate aca ggg acc tgg tac gtg aag gec atg 
Thr Leu Glu Glu Glu Asp He Thr Gly Thr Trp Tyr Val Lys Ala Met 
35 40 45 

gtg gtc gat aag gac ttt ccg gag gac agg agg ccc agg aag gtg tec 
Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg Pro Arg Lys Val Ser 
50 55 60 

cca gtg aag gtg aca gec ctg ggc ggt ggg aag ttg gaa gec acg ttc 
Pro Val Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala Thr Phe 
65 70 75 80 

acc ttc atg agg gag gat egg tgc ate cag aag aaa ate ctg atg egg 
Thr Phe Met Arg Glu Asp Arg Cys He Gin Lys Lys He Leu Met Arg 
85 90 95 

aag acg gag gag cct ggc aaa tac age gec tgc ttg tec gca gtc gag 
Lys Thr Glu Glu Pro Gly Lys Tyr Ser Ala Cys Leu Ser Ala Val Glu 
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atg gac eag ate acg cet gec etc Ugg gag gec eta gec att gac aca 38 4 
Met Asp Gin lie Thr Pro Ala Leu Trp Glu Ala Leu Ala lie Asp Thr 
115 120 125 

ttg agg aag ctg agg att ggg aca agg agg cca agg att aga tgg ggg 432 
Leu Arg Lys Leu Arg lie Gly Thr Arg Arg Pro Arg lie Arg Trp Gly 
130 135 140 

cag gaa get cat gta cct gcagga get gec cag gag gga cca eta cat 480 
Gin Glu Ala His Val Pro Ala Gly Ala Ala Gin Glu Gly Pro Leu His 
145 150 15S 160 

etc tta ctg caa aga cca gca cca tgg ggg cct get cca cat ggg aaa 52 8 
Leu Leu Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Pro His Gly Lys 
165 170 175 

get tgt ggg taggaattct gataccaacc gggaggcect ggaagaattt 577 
Ala Cys Gly 

aagaaattgg tgcagcgcaa gggactctcg gaggaggaca ttttcacgcc cctgcagacg 637 

ggaagctgcg ttcccgaaca ctaggcagcc cccgggtctg cacctccaga gcccacccta 697 

ccaccagaca cagagcccgg accaectgga cctacectcc agccatgacc cttccctgct 757 

cccacccacc tgactccaaa caaag 762 

<210> 12 - 

<211> 179 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Arg Pro Val Thr Cys Arg Gly Arg Gin His Acg Ala Leu Glu Met Lys 
1 5 10 IS 

Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe 

20 25 30 

Thr Leu Glu Glu Glu Asp lie Thr Gly Thr Trp Tyr Val Lys Ala Met 
35 40 45 

Val Val Asp Lys Asp Phe Fro Glu Asp Arg Arg Pro Arg Lys Val Ser 
SO 55 60 

Pro Val Lys Val Thr Ala Leu Gly Gly Gly Lys Leu Glu Ala Thr Phe 
65 70 75 90 

Thr Phe: Met Arg Glu Asp Arg Cys He Gin Ly3 Lys lie Leu Met Arg 
85 90 95 

Lys Thr Glu Glu Pro Gly Lys Tyr Ser Ala Cys Leu Ser Ala Val Glu 
100 105 110 

Met Asp Gin He Thr Pro Ala Leu Trp Glu Ala Leu Ala He Asp Thr 



WO 01/12806 



PCT/FR00/02319 



12 



115 120 125 

Leu Arg Lys Leu Arg lie Gly Thr Arg Arg Pro Arg lie Arg Trp Gly 
130 135 140 

Gin Glu Ala His Val Pro Ala Gly Ala Ala Gin Glu Gly Pro Leu His 
145 150 155 160 

Leu Leu Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Pro His Gly Lys 
165 170 175 

Ala Cys Gly 



<210> 13 

<211> 542 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (43). .(297) 

<223> CDNA2098 (542) - short form ( OBPI lb-gamma ) 
<400> 13 

cgcccagtga cctgccgagg tcggcagcac agagctctgg ag atg aag acc ctg 54 

Met Lys Thr Leu 



ttc ctg ggt gtc acg etc ggc ctg gec get gec ctg tec ttc acc ctg 102 
Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu Ser Phe Thr Leu 
5 10 15 20 

gag gag gag gat gag gga gga teg gtg cat cca gaa gaa aat cct gat 150 
Glu Glu Glu Asp Glu Gly Gly Ser Val His Pro Glu Glu Asn Pro Asp 
25 30 35 

gcg gaa gac gga gga gec tgg caa att cag cgc eta tgg ggg cag gaa 198 
Ala Glu Asp Gly Gly Ala Trp Gin lie Gin Arg Leu Trp Gly Gin Glu 
40 45 50 

get cat ata cct gca gga get gee cag gag gga cca eta cat ctt tta 246 
Ala His lie Pro Ala Gly Ala Ala Gin Glu Gly Pro Leu His Leu Leu 
55 60 65 

ctg caa aga cca gca cca tgg ggg cct get cca cat ggg aaa get tgt 294 
Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Pro His Gly Lys Ala Cys 
70 75 80 

ggg taggaattct gataccaacc gggaggcect ggaagaattt aagaaattgg 347 
Gly 
85 

tgcagcgcaa gggactctcg gaggaggaca ttttcacgcc cctgcagacg ggaagctgcg 407 
ttcccgaaca ctaggcagcc cccgggtctg cacctccaga gcccacccta ccaccagaca 4 67 
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cagagcccgg accacctgga cctaccctcc agccatgacc cttccctgct cccacccacc 527 
tgactccaaa taaag 542 



<210> 14 
<211> 85 
<212> PRT 

<213> Homo sapiens 
<400> 14 

Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala Leu 
15 10 15 

Ser Phe Thr Leu Glu Glu Glu Asp Glu Gly Gly Ser Val His Pro Glu 
20 25 30 

Glu Asn Pro Asp Ala Glu Asp Gly Gly Ala Trp Gin He Gin Arg Leu 
35 40 45 

Trp Gly Gin Glu Ala His He Pro Ala Gly Ala Ala Gin Glu Gly Pro 
50 55 60 

Leu His Leu Leu Leu Gin Arg Pro Ala Pro Trp Gly Pro Ala Pro His 
65 70 75 80 

Gly Lys Ala Cys Gly 
85 



<210> 15 
<211> 10664 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> hOBPIIaGcnc 
<400> 15 

ctctgcttct atattcggcc tccaaatctg tgctcaatgc agcaactgga gtgacccctt 60 
aaatacgtaa gtcacagctt gcctttgtca gagctatcca gggtctttca ctcagagcag 120 
aagctgaagt cctcgtggtg gtccttaatc cctacatggc cgttccaccc actccccagc 180 
ctcatgtgtg gccggtctcc ctggatcatt tgctgtggct gctctgccgt gtgttcccgg 240 
aaactgccag cgttctccca cctctgggct ggcactggat gctcctgcca cctggactcc 300 
tcttccagct gacgagctca tggcttgctt ccttcatgtc ttaaattcgg tgtttgaatg 360 
ccaccttggc gaggctgttc ctcatcaatt catgtaaagg acaaataacc ccttgtctgg 420 
ccactcctgt ccaacttgtc cactttgctt tttccatagc actgatcacc atttaaaata 480 
atgtatgaac cagtgaccca tcagtgacca gagacagtag attgaacagg tattgcttgc 540 
acagtgaaca atcagctcta gaggtttaaa gagaaccaca aagaatggct caggattgca 600 
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cgagggcagt taaggaagaa ataaacaggg gtggggatcc tttccacaag tgtgggttca 660 
gaccgcatgg gagaaggtgt ggttctccca agggaagctg gagaagtttg ctgggttgtc 720 
ccagccacag ctggcccacg gtcagggcag aggccagcag gcagggaaat gtaggttggg 780 
tctggcaagc aggaagcctc tcttcccccg caaggcaggt gggctggggc tgggaagcct 840 
acaagagtca ctgggaaccc acaggtgcag atgccactga atctcaatag gaagccatct 900 
gggggggtcc cctgaattca atgtggtgtg tgccgccaga cgtccccaac ttgtgccact 960 
gccatttata caggaagaga aagaaaaagg aagaaatgga agcatctgga accagtcatc 1020 
ctggacccat gctaggaggt gcttccccct acgcctcaac caacaaagct tagcattgca 1080 
ccaggtgcac aagagaagcg cctgctggct ccagctccat tgcctcggaa ccagccatga 1140 
agggtgcgtg tggagctgga ggcaagacat tgatagctgg cactgcaatt cacttattta 1200 
ttttgttcat tttaagtccc ctgcacctag aatataagcc ccccaagcac aggacatttg 1260 
ttttattgat cgatgtattc cttgtgcccc aaagaatgag aggcatctag aaagtctgca 1320 
aaaatcaaac ataaaaatga acctttattc agtcattgtt attttgatgg atatttgagg 1380 
catttccaag atttgtaagt aacaattgaa cccttctcct ggttcatgtg tgggagtgta 1440 
tctgttcaaa tgatgcttct gagtggagtt gctgagtctt tggctctagg tttttttttt 1500 
ttaagcattt atgctcattg tggtttttaa attaaacatt taaccctgag acactgtaga 1560 
ttcccatgca attgtaaaaa gccacacaga gctatcatgt gtatcttcac ctggcttgct 1620 
ccagccccaa ccccagcaat gacagacctg ttctccactc ctgcaatctg ctcattgcaa 1680 
gaatgtcgtc tattgcaatc ataaaattgt gggattggct ttttttttcc tgtgcagcat 1740 
cattctctgg agattcatcc tattgttgca tttatcaata gtttattcca ttttacttct 1800 
gagtagtgct ctatggtatg gatgtaccac agtctgttta accattcacc tgttggagga 1860 
agtctgtgtt tatagatttg ggctatgaca catgtatagg tttttgcatg gacatcagtt 1920 
ttcatttccc tgggacaaag gcccaggggt tctattgctg gattctatgc ttgttacagg 1980 
gctcattttg ttttgttttg ctttgtttta ttttgttttt aacctgtcaa gccattttcc 2040 
agatccagtt tccatgcatc ctcaccaggc ttcagtatga tcactatgat cttatctcag 2100 
ccaccttaat aggtatgtac tgatatatca tggcttttat ttgcatttca ctgatgacta 2160 
atggtgttga gcatcttttc atgtgtttat ttgccatctg tatatcctct ctagtcaagt 2220 
gtcccttcat gtcttttgtt tacgttctat ttttgaaact gttgagtttt gaaaaattct 2280 
ttataaagta tagaaactaa ttctttgttg aatatgtagt ttgtcaatat tttctttcag 2340 
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tctttggctt gtctttttat tctgttaaca gggtctctta cagagcaaaa ggtttttatt 2400 
ttgatgaagt ctattttaac aatttttcct tttatggatc atatttttgg caacaaatct 2460 
aaaacctcct gacccagctc catatgtcaa agattttctt gttttctaaa agttttatag 2520 
ttttaagttt tatgttgaag tctatgatcc attttgagtt aattttcata cagggtgtga 2580 
gaccaaggtt gtggttcttc tttttctttg gLggttttgt ctgtggatgt ccagttgctc 2640 
cagcccattt gctacaaaag ctatctttcc tccactgaat tacttttgca tctttgtaaa 2700 
aatgtaattg ggtgtatttg tacaggtctg tttgaggatt ctttattgtc tgtcccattg 2760 
atctatgcat ctgtccatgt gctagctata taagtcttga aagggtagcc tgtagctggg 2820 
tgtggtggcg agagcctgta atcccagcta ttcgggaagt ggaggcagga gaatcgattg 2B80 
aacccaggag gtggaggttg cagtgagtca agatcgtgcc actgcactcc agcctgggtg 2940 
acagagtgaa ctccatataa aaaaataaaa acaaaaataa agtagcgtga tttctcccac 3000 
cttattcttt tttaaaaaag ttttagctat tccagttcct ttgcctttcc atataaattt 3060 
tagaataatc ttgtctatat ctaccacaat tctttctgga attttgatag aaattttgtt 3120 
acatctttat attatttgag agaaatgata tttttactat gttgagtctt ctaatccatg 3180 
gatataacgt ctctccattt gtttagatct tctttgattt attttataat cattgcattg 3240 
ttttcagcat acaaatccag catatgggtt gttagactta tgcctaggca tctcattttt 3300 
ttagccatta taaatagtac tgtgttttta agtttagggt ccattactag tatgtagaca 3360 
cacaattgat ctttgtatat ttatcttgtg tcctgcaacc ttgctgaact cacttactag 3420 
ttctaggagg tgtattgttt tcttttgttt tgtttttcaa tttcttggga ttttctacag 3460 
agataataat gtcatctgca gatgcagttt tcttccttcc tttccaattt gtatgtcttt 3540 
aatttccttt ttaaaaaacc tatattactc tgactagaac tttctgtact atgttaaata 3600 
caagtggtag agtggacatc cttgccttgt ccctgatgtt aaagagaaag catttgtaac 3660 
tgagtatccc agcctcaaaa tgtgcttaaa aacttttttc ctttcttgct ttcagccttg 3720 
aaacatactt cgaaactctt tatttctccc tttcccacca ggcacttccg tgagcagtgc 3780 
tcgcttatct aattatgtgc ttacttagaa attccagggg ccaattttga aacaaaccag 3840 
gcagagagac ccagctgcag aatcctccct cttaggggga gttacaggta gcctaccact 3900 
tcccggctga aatcaggatg acgcaaacca gacctccgga cagacgattg atgactcaca 3960 
ataaccatca gaacaagatg cagaccaaca tcctcctgca ccattcccac atatttccca 4020 
caccttttcc tccttaaacc ccttcgctca gtccagaaaa tctgaatggt cttttaaagg 4080 
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catgggtctg gccattcccc aactgccagt atttgaataa agctgctttc cttttaccac 4140 
acctcacttc tcatgccttg acttctgagc agcgagcagc tggacttgag ccagttacac 4200 
atcagtcttt caccattaga aataatgtag ccataggttt ttttttcgta gatgttcttt 4260 
ttcaagttaa agaagttctc ttctattcct atttttctga gaggtgttat cccgaatgag 4320 
tgttgaattt tgttaaatac ttttaacaac ccaacaggaa ccaccatcag aagccatcca 4360 
gagaaacgca ccaggccaca aagcactggg ggccagggat cttgcccctg ctgtctgcca 4440 
tgggttgacc cccagcctcc aaccctacca tcccctgacg gtgtctgcag cagttgaacc 4500 
caaccagcat ctaaaagaac acagttggtg aacgagactg ggacacaggg caagatgggt 4S60 
ggacaatggg aggctcctgg agagcacccg taccagcgag catagaattc atgggggtga 4 620 
cctgttecct gaagcatctg cgcgtgttgt tccagcattt tcttcaagga ttgagccagc 4 680 
agcaccagtg tcatacggtg cttaaatcaa tgattcacag ccaaccaatg aaatacaagg 4740 
tgccggctgg gcgcggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcggg 4 800 
cggatcacga ggtcaggaga tcgagaccat cccggctaaa acggtgaaac cccgtctcta 4 860 
ctaaaaatac aaaaaaaatt agccgggcgt agtggcgggc gcctgtagtc ccagctactt 4 920 
gggaggctga ggcaggagaa tggcgtgaac ccgggaggcg gagcttgcag tgagccaaga 4 980 
tcccgccact gcactccagc ctgggcgaca gagcgagact ccgtctcaaa aaaaaaaaaa 5040 
aaaaaaaaaa aaaagaaata caaggtgcct ggggtacagg caacaaaaat ggggaacgga 5100 
gaaatcttcc cgttgaggta gagactccag cctgtttgct catctctggt cctgaagagc 5160 
ccagtccccg ccctaaggat ggggtttctg ctcggcagca cttgccgtga gaggggtgag 5220 
gcactgggtc acgccagccc tttctttata gcccccaggg tttattagag tgtgcattag 5280 
tattatttag caagcacttg agggtgtctg atcttgggcc agagaggcac aaagattgtg 534 0 
tgggcccagc acctgcctgc agagcgtggg tcagccttgg gtcccagggc agatgacgca 5400 
ggcccgggaa ctacagggtc caggagggag aaggcaaagt tctggctcag gttggctggg 5460 
gatgaggcca gcggagccag gtgcccaagg gagctcagcc acaaactctg agcacaggct 5520 
ggcaggtggc tcttgatgct catgccaccc atttatctaa agggatgaga ttcaaggcct 5580 
gtcctggtgg ctgggggccg tcgaagctga cgagagaggg gatgtagagt gaatgtatat 5640 
tccactctac cacttgtatt taacagggag atggatgatg aacacgtgca ggaggaaaca 5700 
ggcaggacaa tccagagaga tcacgtgttc tgaggacagc acagccaggc tccggtacgg 5760 
agtgaagcgg ggtggggcag gcggcggggt ccctcatatg gcccgaggag gccgtatata 5820 
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tactgacctt gagccacaca atagtgccct tctctgcccc taggaagctc gagtgcaggg 5880 
tccaggtggg gaaaatcaat gcagagtggg tcccagagtg ggcggaagct tgggctctag 5940 
ggcgtgcggg actcagctgg cagcagcccc acatctctat ggttctaaag cccagtccat 6000 
ttctgctcag cagggagatg gccagttccc cagaggactt gcccagggtc ccagccgtgq 6060 
ccctgggagg gctcaggggt gagggaggga gaattccgag ccgtgggccc tccctctgtg 6120 
gctcggggtg cagacgtgca ctaccctgcc ctgtcctctg gagtcctccg ggctcatcca 6180 
agtgggcgca tttgggtgca aaccctggac agcatcgtgt gtcttttctt ctctggctgg 6240 
cactgaattg ctgttcacca gatgccagca atgaggacac tccccctccc ggatgcagga 6300 
ctggttgcca ccatggggag gggtgcaccg tgccacgtgc tcccaggaca ctggccaggg 6360 
ggctataaag aacatctcga gaggagccag cacagccttg ttcagacgcc cagtgacctg 6420 
ccgaggtcgg cagcacagag ctctggagat gaagaccctg ttcctgggtg tcacgctcgg 6480 
cctggccgct gccctgtcct tcaccctgga ggaggaggat gtgagctggg ttggcgtggg 6540 
cggatggagg agccaggtgg actcctgggc agggggcagt gccaggggcc ctgctttagg 6600 
aggtgteact taagcctggg gtggtggtgg aggggtccta ttgttccttc agcagacaat 6660 
gctccccatg aggcccaggg ccggagcagg ctcggctcag gggttcctgc tgcactgacg 6720 
cctgaagccc gaaggtctcg cagggttggg ccctgtggag ggagggctca cctggtgctg 6780 
gggcccgggg gtccatgggg tgcagacatg ccctccttcc actgggggct gggagccctg 6840 
agcagggggc tggctctaac tcactccagc tgagctctaa ctaaggtgca ggaacccagc 6900 
ctgcctttag gggtggcagc cgggcaccat gggtgtctgg ttatagctgc aggcctgagt 6960 
gccagggtca gagtagaatc tgggccaccc atggtgggct cacggccttg gcctgctcca 7020 
gatcacaggg acctggtacg tgaaggccat ggtggtcgat aaggactttc cggaggacag 7080 
gaggcccagg aaggtgtccc cagtgaaggt gacagccctg ggcggtggga acttggaagc 7140 
cacgttcacc ttcatgtgag tgttgcccac tgcagggccc ctcaggccac tttcgctccc 7200 
cgccccagac ccacctggtg cccattgccc catccacatt tcgggtgttg ggaagagtca 7260 
ccccctgcct tggagggaaa cagccagggc atcctgaagc tcggtggggt gggggggcag 7320 
tggaattttc aggttgccgg gtcagggcca tgcaccaggt gagctgagga tgggccaggt 7380 
gtgtcctggg agccgctgcc cgcgtgtctc ctgttttcca ggagggagga tcggtgcatc 7440 
cagaagaaaa tcctgatgcg gaagacggag gagcctggca aatacagcgc ctgtgagccc 7500 
ctccccgacc ccccactccc catgcccaac cccggatgca ccagccccac tgcaggtgga 7560 
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gagtgcccag gccacacttc tgccagggtc ccagccctgc ccacctccaa ggaggggctg 7620 
gcctctcctt cctggggggc tggtggccct gacatcagac accaggtgtg acaggcttgt 7680 
ccacagtaga gatggaccag atcaagcctg ccctctggga ggccctagcc attgacacat 7740 
tgaggaatcc gagtgttagg gaccaggagg ccgagggtta gggatgggaa gccaaggctg 7600 
agggtttggg atcaggaatc cgagggttag ggacagggaa gtggggcagg agcagctgct 7860 
ggagctggga aggccggact ctagtcctgg acgtgctctg gccttgtggc tccattactt 7920 
gcattgggac cttccgagag gaggctcctg cctccgtgtc cgggtccatg ctgtgcggag 7980 
cagccaggcc tggctcaggc tgtccagggc acctgggtga ccactgaaac attcctgagt 804 0 
gtttcttcgt gtggtcctga gtgctctctc cgggaatgag ggcactgaag acccatcttc 8100 
tctgtcatct acagatgggg gcaggaagct catatacctg caggagctgc ccgggacgga 8160 
cgactacgtc ttttactgca aagaccagcg ccgtgggggc ctgcgctaca tgggaaagct 8220 
tgtgggtgag gggcccgctg gggcctgcat gtcctgcccc atggtctctg cctccagaag 8280 
ccagtggaac caccatcatc acgccctggc acggggggaa aaggaagccc cctgcgccgg 834 0 
ccttcgtgtg ctaggcacca agcgctgccc tggatggctg gtccaagttc ctgaagtggg 8400 
agtggggtgg gccaggcagg gacagacacg gccctcggtg acgtgaacct gccaagggcc 84 60 
gcttgtgggg tctcaggtgt aggggcctca ccttaagggg gaggtancat cttaacagag 8520 
ctcttcatgg ggcagggact ctccagggcg gcagggcagc cagtgcctct gggacacaag 8580 
gtccctccag gtgagggttg tgaccctgca gagtggcttt gggagctgcc caggtccccc 8640 
tggggttgct gagtggcttg gaccctgcca ctgtcccctt tcctggggac ctctcacctg 8700 
ggcggtggcc gtctcctctg tccccagtcc cacccctgag ctcttgtcca ttctcaggcc 8760 
tcctctcccc cttgcctggt gctggacagt tgccatctct tctgtcccca gccccacccc 8820 
tgagctctga tccactctcg ggcctctccc ccgtcctgat gctgggccgt ggtcgtctcc 8880 
ttttagcatc tgctccctgc agggccgtgc cgctgtcccc acgtcggctc acctggccac 8940 
ctcacctgca ggtaggaatc ctaataccaa cctggaggcc ctggaagaat ttaagaaatt 9000 
ggtgcagcac aagggactct cggaggagga cattttcacg cccctgcaga cgggtgagga 9060 
cggctgtgcc cagtaccccg tgttcccctg tgtctctgtg tgatctccag tgtcccatga 9120 
ccctcgtgtc ctcccatgtc ccccgcattc cccatgtgcc ccgagtctcc tcgcaggggc 9180 
tctgggccct gcttagcatc ctcgtcgttg gagggtctgc actctgggct gcgatggggt 924 0 
ctggggctcc gcgctctggg ctgcgatgcg gtctggggct ccgcgctctg ggctgcgatg 9300 
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gggtctgggg ctccgcgctc tgggctgcga tggggtctgg ggctccgcgc tctgggctgc 9360 
gatggggtct ggggctccgc gctctgggct gcgatggggt ctggggctcc gcgctctggg 9420 
ctgcgatggg gtctggggct ccgcgctctg ggctgcgatg gggtctgggg ctccgcgctc 9480 
tgggctgcga tggggtctgg ggctccgcgc tctgggctgc gatggggtct ggggctctga 9540 
gctctgggct gcgatggggt ctgggccctg gtctagggcg ccttctaatc cctgggtttt 9600 
tcttggtctc tgcaggaagc tgcgttctcg aacactaggg tgagtgagcc tttaggaggg 9660 
cactggacaa gcccagagtc ctgggttccc ggggttcgag ggtacatctg ctctggccct 9720 
tcccatccca cacagccagg gagacccccc cagggtcagg cacgaggttg gcacctcaga 9780 
gtctgcccac ccaaaattcc tgggacattc gggaagtcct ttgttttacc attcctgcac 9840 
ctgccagccc gagtgagggt cctcctcggc ctttccacag cgaggcctcc ctccggctcc 9900 
ctcaggtgtc agctcggccc atcgccccct gcacctgtcc ccactcggtc tactcccccc 9960 
cacccactca ccaaggatgc tcagaggcct gcccagtcat tggaggagct gaggctgctc 10020 
tggagccccg aggctgccca gcagtggccg atgtggaagc tgcagagcct ggggagggag 10080 
ctctgggcct ggcctctgcc ctaccctgca gcctccctga cctctgctcc tcttcccagc 10140 
acagcagccc ccgggtctgc acctccagag cccaccctac caccagacac agagcccgga 10200 
ccacctggac ctaccctcca gccatgaccc ttccctgctc ccacccacct gactccaaat 10260 
aaagagcttc tcccccagct ctgggcaggc ctatctgtgg ggacggaggg gctcgcaccg 10320 
gctccctggg aggcctgctg ggagggggag ccacaaggga agctggggag atgttggacc 10380 
tagcaagggc caaaccacaa gaggcacgat gtccaacagg ctgtggggct gcgtgatgtc 10440 
ctggaggggc ctccggaaga gccgccctcg gatctcaggc tcagcttggg acgggcaggg 10500 
ctctgggcag gagcacttgg gaagccactg ggagggccgg agtggggaca cggcgtcaga 10560 
cctgcatcag tgggtccacc acagggctcc cgccccgggg gtctcagtgg gatcctccga 10620 
gcgggtccac tttancatcc tgngctcagt ggcatcactg gata 10664 

<210> 16 

<211> 13591 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> hOBPIIbGcnc 
<400> 16 

tctctgcttc tatattcggc ctccaaatct gtgctcaatg cagcaactgg agtgacccct 60 



WO 01/12806 PCT/FR00/02319 



taaatacgta agtcacagct tgcctttgtc agagctctcc agggtctttc actcagagca 120 
gaagctgaag tcctcgtggt ggtccttaat ccctacatgg cagttccacc cactccccag 180 
cctcatgtgt ggccggtctc cctggatcat ttgctgtggc tgctctgctg tgtgttcccg 240 
gaaactgcca gcattctccc acctccaggc tggcactgga tgctcctgcc acctggactc 300 
ctcttccagc tgacgagctc atggcttgct tccttcatgt cttaaattcg gtgtttgaat 360 
gccaccttgg cggggctgtt cctcatcaat tcatttaaag gacaaataac cccttgtcct 420 
gacactcctg tccaacttgt ccactttgct ttttccatag cactgatcac catttaaaat 480 
aacgtatgaa ccagtgaccc atcaggatcc agagacaata gattgaacag ggattgcttg 540 
cacaatgaac aatcagctct agaggtttaa agagaaccac aaagaatggc tcaggattgc 600 
acgagggcag ttaagganga aataaacagg ggtggggatc ctttccacaa gtctgggttc 660 
agacctcatg ggagaaggtg tggttctccc aagggaagct ggagaagttt gctgggttgt 720 
cccagccaca gctggcccac ggtcagggca gaggccagca ggcagggaaa tgtaggctgg 780 
gtctggcaag caggaagcct ctctccccac ccaaggcagg tgggctgggg ctgggaagcc 840 
tgcaagagtc actgagaacc cacaggtgca gatgccactg aatctcaata ggaagccatc 900 
tgggggcggt cccctgaatt caatgtggtg tgtgccgcca gacgtccccg acttgtgcca 960 
ctgccatttg cataggaaga gaaagaaaaa ggaagaaatg gaagcatctg gaaccagtca 1020 
tcctggaccc atgctaggag gcgcttctcc ctacgcctca accaacaaag cttagcattg 1080 
caccaggtgc acaagagaag cgcctgcttg ctccagctcc attgcctcgg aaccggccat 1140 
gaagggtacg tgtggagctg gaggcaagac attgatagct ggcactgcaa ttcacttatt 1200 
tattgtgttc attttaagtc ccctgcacct agaatataag ccccccgagc acaggacatt 1260 
tgtttcattg atcgatgtat tccttgtgcc ccaaagaatg agaggcatct agaaagtctg 1320 
caaaaatcaa acataaaaat gaacctttat tcagtcattg ttattttgat ggaaatttga 1380 
ggcatttcca agatttgtaa gtaacaattg aacccttctg ctggttcatg tgtgggagtg 1440 
tatctgttga aatgatgctt ctgagtggag ttgctgagtc tttggctcta ggtttttttt 1500 
ttttaagcat ttatgcttat tgtggttttt aaattaaaca tttaaccctg agacattgta 1560 
gattcccatg cagttgtaaa aagccacaca gagctatcat gtgtatcttc acctggcttg 1620 
ctccagcccc aaccccagca atgacagacc tgttctccac tcctgaaatc tgctcattgc 1680 
aagaatgtcg tctattgcaa tcataaaatt gtgggattgg cttttttttt tcctgtgcag 1740 
catcattctc tggagattca tcccattgtt gcatttatca atagtttatt ccattttact 1800 
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tctgagtagt gctctatggt atggatgtac 
ggatgtctgt gtttatagat ttgggctatg 
gttttcattt ttctgggaca aaggcccagg 
agggtttttt tttttaaacc tgtcaagcca 
cctcaccagg cttcagtatg atcactatga 
ctgatatatc atggctttta tttgcatttc 
catgtgttta tttgccatct gtatatcctc 
ttacattcta tttttgaaac tgttgagttt 
attctttgtt gaatatgtag tttgtcaata 
ttctgttaac agggtctctt acagagcaaa 
caatttttcc ttttatggat catatttttg 
ccatatccca aagattttct ttctgttttc 
taagtctatg atccattttg agttaatttt 
cttctttttg tttggtggtt ttgtctgtgg 
aaagctatct tccattgaat tgcttttgca 
tgcaggtctg tttgaggatt ctttattgtc 
gctagctata taatgagtct tgaaagggta 
gtaatcccag ctacttggga ggcggaggca 
ttgcaatgag tcgagattgt gctactgcac 
ctctaaaaaa aagaaagaag aagaaaagga 
attcttaaaa aaaaaagttt tagttattct 
aataatcttg tctatgtcta caaaaatccc 
tctttatatt atttgagaga aatgaegctt 
atagcacatc tctccatttg tttagatctt 
tttcagcata caaatccagc atatgggttg 
tagccactat aagtagtatt gtgtttttaa 
acaattgatc tttgtatatt tatcttgtat 
tctaggaggt gtattgtttt cttttgtttt 
gacaatcatg tcatctgcag atgcagacag 



cacagtctgt ttaactattc acctgttgga 1860 
acacatgtac aggtttttgc atggacatca 1920 
ggttctattg cggggttcta tgcttgttgc 1980 
ttttccagaa actgtccagt ttccatgcat 2040 
tcttatctca gccaccttaa taggtatgta 2100 
actgatgact aatggtgttg agcatctttt 2160 
tctagtcaag tgtcccttca tgtcttttgt 2220 
tgaaaaattc tttataaagt atagaaacta 2280 
ttttctttca gtctttggct tgtcttttta 2340 
aggtttttat tttgatgaag tctattttaa 2400 
gtgacacatc taaaatctcc tgacccagct 2460 
taaagctttt atagttttaa gttttatgtt 2520 
cttgcagggt gtgagaccaa ggttgtggtt 2580 
atgtccagtt aatccagccc atttgctaca 2640 
tctttgtaaa aacttaactg ggtatatgcg 2700 
tgtcccattg atctatgcat ctgtccatct 2760 
gcctgtagct gggtatggtg gtgtgcatct 2820 
ggagaatcac ttgaatctgg gaggtggagg 2880 
tccagcctgg gtgacagagc aagactctgt 2940 
aagaaagggt agcctgattc ctcccacttt 3000 
agttcctttg cctttccata taaatgttag 3060 
ttctggaatt ttgatagaaa ttgtgttaaa 3120 
ttattatgtt gagtctccta atccatggat 3180 
ctttgattta ttttataatc attgcattat 3240 
ttaaacttat gcctaggtat ctcttttttt 3300 
gtttagggtc cgttactagt atgtagacac 3360 
cctgcaacct tgctgaactc gcttaatagt 3420 
gtttttcagt Ctcttgggat tttctacaga 3480 
tcttcttcct tcctttccaa tttgtatgtc 3540 
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tttaatttcc tttttaaaaa acctttattg ctctgactag aactttctgt actatgttaa 3600 
atacaagtgg tgaaagtgga catccttgcc ttgtccctga tgttaaggag aaagcgtttg 3660 
taactgagta tcccagcctc aaaatgtgct taaaaacttt tttcctttct tgctttcagc 3720 
cttgaaacat acttcgaaac tctttatttc tccctttccc accaggcact tctgtgagca 3780 
gtgctcgctt atctaattat gtgcttactt agaaattcca ggggccaatt ttgaaacaaa 3840 
ccaggcagag agacccagct gcagaatcct ccctcttagg gggagttaca ggtagcctac 3900 
cacttcccgg ctgaaatcag gatgactcaa accagacctc tggacagacg attaatgact 3960 
catgataacc attggaacaa gatgcagacc aacatcctcc tgcaccattc ccacatattt 4020 
cccacacctt ttcctcctta aaccccttcg ctcagtccag aaagtttgaa tggtctttta 4080 
aaggcatggg gcctggccat tcctctactg ctagcatttg aataatgctg ctttcctttc 4140 
accacacttc acttctcatg ccttgacttc tgagcagcga gcagctggac ttgagccagt 4200 
tacacatcag tctttcacca tgagaaatga tgttagccat aggttttttg tagatgctct 4260 
ttgtcaagtt aaggaggttc tcttctattc ctacttttct gagagttgtt ttcctgaatg 4320 
cgtgatgaat ttagtcaagt actttttctg cattgatcga tatgatcatg taatttttct 4380 
tcttaacata ttaatatggt tgatttttga gtattgtacc aggtttccat ccctggaata 4440 
aactgctttt ggtcatggtg tagaattaat tttatatatt attatttgct aatattttgt 4500 
aaaggatttc tgtatctata ttcatgaggt atattgttct gcagctttcc tttttgtgtt 4560 
gtcaggtttt ggttgggaga tattctctca tcttctgaga ttatgcagag ttggtgttat 4620 
ttatttttta aatggttggt agaattttct aatgaaacca tatggacatg aagaattatt 4680 
tttggaagct tttaaaatta aaatgttaac tttaattgtc atagagctat taaagttatc 4740 
tatcttatat tggcattgtg ttttttgaga aattggttca tttcacctat gttgtcaaat 4800 
atatgtggga aggtctcttc gtagtattcc cttattatcc tttgatgtct gcagggtctg 4860 
tagtgatagt ctctgttttg ttccagatgt tggttatttg tgtctttctt tctttttttg 4920 
tttgtcagtc tttctagaac tgcctcaatt ttattaattt ttccagagaa ctaacccttg 4980 
tttcattgtt tttctctgtt gtttttctgc tttcgatttt ttctgctctc tgcttttgat 5040 
ttattaattg ctgctcttat ctttgttatt tcttttcttc ttgctttggg tttattttgc 5100 
tcttcttttt ctaggttctt gaagtgggag cttagatagt tgattttgga cttctctttt 5160 
ctaatatatg catttaatgc ctctcagcac tgatttagct gtgtctcaca aattttgata 5220 
tgtttttgtt tgtttgtttt tgaggtagag tctcgctctg tatcccaggc tggagtgcag 5280 
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tggcactatc ttggctcact gcaacctcca cctcctgggt tgaagtaatt ctcctgcttc 5340 
agccccctaa gtagctggga ttacaggtat acgccaccat gcccagcaat ttatttattt 54 00 
atttattttt gtatttttag tagagatgag gttttgtcat gctgtccagg ctggtctcga 5460 
actcctgacc tcaagtgatc tgcccacgtg gggctccaaa gcactgagat tataggcata 5520 
agccactgtg cccagcetgt ttttgttttt tttgagatgg agtcttgctc tgtcacccag 5580 
gctggagtgc agtggcatga tagcccactg caacctccgc ctcccgggtt caagtgattc 5640 
tcctgcctca gcctcccgat tagctgggat tacaggcgtg caccaccaca ctcagctagt 5700 
ttttgcattt ttggtagaga tgggttttcg ccatgttggc caggctggtc tcgaattcct 5760 
gatcgcaagt gatccacccg cctgggcctc ccaaagtgct ggaattatag gcacaagcca 5820 
tcacgcccag ctgatatgtt gtattttcat tttcatccag cctagtatat ttttaaattt 5880 
accttcaggc ttcctctttg acctgtggat tattcacagg tgtgttgttt ccaagcatct 5940 
gaagaaagat tttccattat ctttctggca ttgatttata gttagattcc atgtggtcag 6000 
agactaccct ctatatcatt taaattttta aaaatttctt gaggctcatt ttctcatcca 6060 
ggatatggtc tatcttggta Latattctgg tgcacttgaa aagaatgtgt gtagtgctgt 6120 
tgccaggtgg tgtgttctac aaatgtcaat tcgattatgt taattgatgg gtggccgagt 6180 
tctttgatgt ccctgctggt tttctgtcta gttgctctga gagaagggta ttgaagtctc 6240 
caactataat tgtggctttt taaatttctc ctttcagttc tgttttgctt cacatattgt 63O0 
gcagctaatg tttggtgcac acacgtttag gattactatg tcttctttgc agaatggccc 6360 
acttattatt gtataatttc cctctctgtt aattagtctt tgttctaaag tctattttac 6420 
ctgttattga aataatgctt ctgctttctt ttgattaatg tttatatatc ttttttcatg 6480 
tttttacttt caacctgtct ctattgttac atttgaagta agtttcttgt agacggcata 6540 
tagttgtata atggctttta attcactttg ccaatcactg tcatttaata tttaaaccat 6600 
ttacatttaa tataattatt aacatgctag agtttgtatt atttttatag ggttgctata 6660 
acaaaatact accaactggg tggcttagaa caaaaattta ttttctcaca ttctggtggc 6720 
cagaagtctg agatcaagtg agatttggaa ggtagaaaaa ccaacaaaat atgttaatga 6780 
tgttaccatt gtgggcaact gggactcagt ccttctgagg gccatccgag aaatagggtg 6840 
gaaccatatt tgattgttcc acttagaagg aagagtccag accatttatc ccatcagtta 6900 
ctatccctac agactgagga tgctgattca cttgcacatc tgggtttcat cagtgtcagg 6960 
ggaagcacag aagaaaaggt gcaggcactt aaggtgggaa gctgtgaata cgtccgtgca 7020 
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gggctctagg gcgtgcggga ctcagctggc agcagcccca cgtctctctg gttctaaagc 8820 
ccagtccatt tctgctcagc agggagatgg ccagttcccc agaggacttg cccagggtcc 8880 
cagccgtggc cctgggaggg tgcaggggtt ggtgagggag tatcccgagg ctgtgggccc 894 0 
tgcctctgtg gcttggggtg cagatggaca ccaccctgcc ctgtcctctc gagtcctctg 9000 
ggctcatctg agtgggcgcg ttcgggtggt gcaaactctg gatggcatcg tgtgtctttt 9060 
cttctctggc tggcactgac ttgctgttca ccagatgcca gcaatgagga cactccccct 9120 
cccggatgca ggactggttg ccaccatggg gaggggtgca ccgtgccacg tgctcccagg 9180 
acactggcca gggggctata aagaacatct cgagaggagc cagcacagcc ttgttcagac 9240 
gcccagtgac ctgccgaggt cggcagcaca gagctctgga gatgaagacc ctgttcctgg 9300 
gtgtcacgct cggcctggcc gctgccctgt ccttcaccct ggaggaggag gatgtgagct 9360 
gggttggcgt gggcggatgg aggagccagg cggactcctg ggcagggggc agtgccaggg 9420 
gccctgcttt aggaggtgtc actgaaggct ggcttctgac cccgtgctcc cagcctgggg 9480 
tggtggtgga ggggtcctat tgttcctcca gcagacacgg ccccccgagg cccagggccg 954 0 
gagcaggctcgtctcagggg ttcctgctgc actgacgcct gaagcccgaa ggtctcgcag 96O0 
ggttgggccc tgtggaggga gggctcacct ggtgctgggg cccgggggtc catggggtgc 9660 
agacatgccc tccttccact gggggctggg agccctgagc agggggctgg ctctaactca 9720 
ctccagctga gctctaacta aggtgcagga acacagcctg cctttaaggg cagcagccgg 9780 
gcaccatggg tgtctggtta tagctgcagg cctgagtgcc agggtcagag tagaatctgg 9840 
gccacccatg gtgggctcac ggccttggcc tgctccagat cacagggacc tggtacgtga 9900 
aggccatggt ggtcgataag gactttccgg aggacaggag gcccaggaag gtgtccccag 9960 
tgaaggtgac agccctgggc ggtgggaagt tggaagccac gttcaccttc atgtgagtgt 10020 
tgcccactgc agggcccctc aggccacttt cgctcccctc cccagaccca cctggtgccc 10080 
attgccccat ccacgtttcg ggtgttggga agagtcaccc cctgccttgg agggaaacag 10140 
cctgggcatc ctgaagctcg gtggggtggg ggacagtgga attt tcaggt tgccgggtca 10200 
gggccatgca ccaggtgagc tgaggatggg ccatgggtgt cctgggagcc gctgcccgcg 10260 
tgtctcctgt tttccaggag ggaggatcgg tgcatccaga agaaaatcct gatgcggaag 10320 
acggaggagc ctggcaaata cagcgcctgt gagcccctcc cccactccca cccccaccct 10380 
cccccaccgc caaccccagt gcaccagcct ccacaggtag agagtgccca ggctgccctt 104 40 
ttgccagggc cccagctctg cccacctcca aggaggggct ggcctctcct tcctgggggg 10500 
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ctggtggccc tgacatcaga caccgggtgt gacaggcttg tccgcagtcg agatggacca 10560 
gatcacgcct gccctctggg aggccctagc cattgacaca ttgaggaagc tgaggattgg 10620 
gacaaggagg ccaaggatta ggtacgggga ggctgagggt tagggatggg gaagctgagg 10680 
gttaaggatg gggaagctga gggttaagga tggggaggct gagggttagg gacgggggcc 10740 
aagggttagg gatcgggaag ctgagggtta gggatgagga ggccgagggt tagggatggg 10800 
gaggccgagg gttagtgctg cggaagctga ggattagaga tggggaggct gagggggaag 10860 
atgagggtta gggacaggga agtggggcag gagcagctgc tggagctggg aaggccggac 10920 
tctagtcctg gacgtgctct ggccttgtgg ctccattact tgcattggga ccttcctgag 10980 
aggaggctcc tgcctccgtg tccgggtcca cactgtgcgg agcagccagg cctggctcag 11040 
gctgtccagg gcacctgggt gaccgctgaa acattcccga gtgtttcttc atgtggcccc 11100 
gagtgttctc tctgggaatg acccattttc tctgtcatct acagatgggg gcaggaagct 11160 
catgtacctg caggagctgc ccaggaggga ccactacatc ttttactgca aagaccagca 11220 
ccatgggggc ctgctccaca tgggaaagct tgtgggtgag gggcttgctg gggtctgcgt 11280 
gtcctgcccc acggtctctg cctctggaag ccggtggaac cagcaccacc atgccctggc 11340 
atgggggaaa aggaagcccc ctgtgccggc tttcatgtgc cgggcaccgc tgggcagccg 11400 
gtccaagtac ctgaagtggg aatgggatgg gccaggcagg gacagacatg gccctcagtg 11460 
acatgaacct gccaagggcc acttctgggg tctcgggtgt aggagggtca ccttaagggg 11520 
gagggtcacc ttaagcgtcc taagagagct cttcatgggg cagggaccct ccagggcagg 11580 
agggtggccg gtgcctctgg gagacaaggt ccctccaggt gagggctgtg accctgcagg 11640 
gtggcattgg gagctgccca ggtcccctgg ggttgccgag tggcttggag agtccccgcc 11700 
actgtccccc ttcctgggga cctctcatct gggcagtgac tgtctcctct gtccccagtc 11760 
ccacccctga gctctcatcc attctcaagt ctcctctcct gcttgtccgg tgctgggcta 11820 
tggccatctc ttctgtcccc ggccccaccc cagagctctg gtccactctt gggtctctcc 11880 
ccttgtcctg gtgctgggca gtggccatct cctctgtccc cagccccacc cccgagctct 11940 
gatccactct caggcctctc cccctgtcct ggtgctgggc cgtggtcgtc tcctttcggc 12000 
atctgctccc tgcagggccg tgccgctgtc cccacgtcgg ctcacctggc cacctcacct 12060 
gcaggtagga attctgatac caaccgggag gccctggaag aatttaagaa attggtgcag 12120 
cgcaagggac tctcggagga ggacattttc acgcccctgc agacgggtga ggatggctgt 12180 
gcccagtccc ctgtgtccct ctgctgtgtc tgtctgctat ctccagtgtc ccatgacccc 12240 
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catgtcctcc catgtccccc gcattcccca 
gggccctgtt tagcgtcctc ctcattggag 
gggctccgcg ctctgggctg cgatggggtc 
tctggggctc cgcgctctgg gctgcgatgg 
ggggtctggg ccctggtcta gggcgccttc 
gaagctgcgt tcccgaacac tagggtgagt 
gagtcctggg ttcccggggt tcgagggtac 
ccagggagac ccccccaggg tcaggcacga 
gttcctggga cattcgggaa gtcctttgtt 
agggtcctcc tcggcctttc cacagcgagg 
agcccatcgc cccctgcacc tgtccccacg 
gatgctcaga ggcctgccca gtcattggag 
gcccagcagt ggccgatgtg ggggctgcag 
ctgccctacc ctgcagcctc cctgacctct 
tctgcacctc cagagcccac cctaccacca 
ctccagccat gacccctccc tgctcccacc 
cagctctggg caggcctatc tgtggggaca 
tgctgggagg gggagccaca agggaagctg 
cacaagaggc acgatgtcca acaggcctga 
ggaagagccg ccctcaggtc tcaggttcag 
gtttggaaag ccactgggga ggacaggagc 
ggccaccgca gggctcccgc ctcaggggtc 
agcagcctgg gctcagtggc agcactggag 



27 

tgtgccccga gtctcctcgc aggggctccc 12300 
gctctgtgct ctgggctgcg atggggtctg 12360 
tggggctccg cactctgggc tgcgatgggg 12420 
gctctggggc tctgagctct gggctgtgat 124 80 
taatccctgg gtttttcttg gtctctgcag 12540 
gagcctttag gagggcactg gacaagccca 12600 
atctgctctg gcccctccca tcccacacag 12660 
ggttggcacc tcagagtctg cccacccaca 12720 
ttatcattcc tgcacctgcc agcccgagtg 12780 
cctccctccg gctccctcag gtgtcagctc 12840 
cggtccactc cccaccatcc acccaccaag 12900 
gagctgaggt cgctctggag ccccgaggtt 12960 
agcctgggga gggagctctg gccctggcct 13020 
gctcctcttc ccagcacagc agcccccggg 13080 
gacacagagc ccggaccacc tggacctacc 13140 
cacctgactc caaataaagt ccttctcccc 13200 
gaggggcttg cacccgctcc ctgggaggtc 13260 
gggagatgtt ggacctagca agggccaaac 13320 
ggggctgcgt gatgtcctgg aggggcctcc 13380 
cttaggacag gcagggccct gggcaggagc 13440 
ggggacacgg cgtcagggct gcagcagtgg 13500 
tcagagggat cctccaagct cccccatttt 13560 
a 13591 
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